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W fP PATENT: 129.21USU1 

NOVEL GENF UPREGULATED IN CANCERS OF THE PROSTATE 



This application claims the benefit of United States provisional patent application 
number 60/162,364, filed October 28, 1999, the entire contents of which are incorporated 
5 herein by reference. 

FTELD OF THE INVENTION 

The invention described herein relates to a novel gene and its encoded protein, 
termed 20P2H8, and to diagnostic and therapeutic methods and compositions useful in the 
10 management of various cancers which express 20P2H8, particularly including cancers of the 
bladder, prostate, colon and pancreas. 

BACKGROUND OF THE INVENTION 

Cancer is the second leading cause of human death next to coronary disease. 

15 Worldwide, millions of people die from cancer every year. In the United States alone, 
cancer causes the death of well over a half-million people each year, with some 1.4 million 
new cases diagnosed per year. While deaths from heart disease have been declining 
significantly, those resulting from cancer generally are on the rise. In the early part of the 
next century, cancer is predicted to become the leading cause of death. 

20 Worldwide, several cancers stand out as the leading killers. In particular, carcinomas 

of the lung, prostate, breast, colon, pancreas, and ovary represent the primary causes of 
cancer death. These and virtually all other carcinomas share a common lethal feature. With 
very few exceptions, metastatic disease from a carcinoma is fatal. Moreover, even for those 
cancer patients who initially survive their primary cancers, common experience has shown 

25 that their lives are dramatically altered. Many cancer patients experience strong anxieties 
driven by the awareness of the potential for recurrence or treatment failure. Many cancer 
patients experience physical debilitations following treatment. Many cancer patients 
experience a recurrence. 

Generally speaking, the fundamental problem in the management of the deadliest 
30 cancers is the lack of effective and non-toxic systemic therapies. Molecular medicine, still 
very much in its infancy, promises to redefine the ways in which these cancers are managed. 
Unquestionably, there is an intensive worldwide effort aimed at the development of novel 
molecular approaches to cancer diagnosis and treatment For example, there is a great 
interest in identifying truly tumor-specific genes and proteins that could be used as 
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diagnostic and prognostic markers and/or therapeutic targets or agents. Research efforts in 
these areas are encouraging, and the increasing availability of useful molecular technologies 
has accelerated the acquisition of meaningful knowledge about cancer. Nevertheless, 
progress is slow and generally uneven. 

5 As discussed below, the management of prostate cancer serves as a good example of 

the limited extent to which molecular biology has translated into real progress in the clinic. 
With limited exceptions, the situation is more or less the same for the other major 
carcinomas mentioned above. 

Worldwide, prostate cancer is the fourth most prevalent cancer in men. In North 

10 America and Northern Europe, it is by far the most common male cancer and is the second 
leading cause of cancer death in men. In the United States alone, well over 40,000 men die 
annually of this disease - second only to lung cancer. Despite the magnitude of these 
figures, there is still no effective treatment for metastatic prostate cancer. Surgical 
prostatectomy, radiation therapy, hormone ablation therapy, and chemotherapy remain fixed 

15 as the main treatment modalities. Unfortunately, these treatments are ineffective for many 
and are often associated with significant undesirable consequences. 

On the diagnostic front, the lack of a prostate tumor marker that can accurately 
detect early-stage, localized tumors remains a significant limitation in the management of 
this disease. Although the serum PSA assay has been a very useful tool, its specificity and 

20 general utility is widely regarded as lacking in several important respects, as further discussed 
below. Most prostate cancers initially occur in the peripheral zone of the prostate gland, 
away from the urethra. Tumors within this zone may not produce any symptoms and, as a 
result, most men with early-stage prostate cancer will not present clinical symptoms of the 
disease until significant progression has occurred. Tumor progression into the transition 

25 zone of the prostate may lead to urethral obstruction, thus producing the first symptoms of 
the disease. However, these clinical symptoms are indistinguishable from the common non- 
malignant condition of benign prostatic hyperplasia (BPH). Early detection and diagnosis 
of prostate cancer currently relies on digital rectal examinations (DRE), prostate specific 
antigen (PSA) measurements, transrectal ultrasonography (TRUS), and transrectal needle 

30 biopsy (TRNB). At present, serum PSA measurement in combination with DRE represent 
the leading tool used to detect and diagnose prostate cancer. Both have major limitations 
which have fueled intensive research into finding better diagnostic markers of this disease. 

Similarly, there is no available marker that can predict the emergence of the typically 
fatal metastatic stage of prostate cancer. Diagnosis of metastatic stage is presently achieved 



by open surgical or laparoscopic pelvic lymphadenectomy, whole body radionuclide scans, 
skeletal radiography, and/or bone lesion biopsy analysis. Clearly, better imaging and other 
less invasive diagnostic methods offer the promise of easing the difficulty those procedures 
place on a patient, as well as improving diagnostic accuracy and opening therapeutic 

5 options. A similar problem is the lack of an effective prognostic marker for determining 
which cancers are indolent and which ones are or will be aggressive. PSA, for example, fails 
to discriminate accurately between indolent and aggressive cancers. Until there are prostate 
tumor markers capable of reliably identifying early-stage disease, predicting susceptibility to 
metastasis, and precisely imaging tumors, the management of prostate cancer will continue 

10 to be extremely difficult. 

PSA is the most widely used tumor marker for screening, diagnosis, and monitoring 
prostate cancer today. In particular, several immunoassays for the detection of serum PSA 
are in widespread clinical use. Recently, a reverse transcriptase-polymerase chain reaction 
(RT-PCR) assay for PSA mRNA in serum has been developed. However, PSA is not a 

15 disease-specific marker, as elevated levels of PSA are detectable in a large percentage of 
patients with BPH and prostatitis (25-86%)(Gao et al., 1997, Prostate 31: 264-281), as well 
as in other nonmalignant disorders and in some normal men, a factor which significandy 
limits the diagnostic specificity of this marker. For example, elevations in serum PSA of 
between 4 to 10 ng/ml are observed in BPH, and even higher values are observed in 

20 prostatitis, particularly acute prostatitis. BPH is an extremely common condition in men. 
Further confusing the situation is the fact that serum PSA elevations may be observed 
without any indication of disease from DRE, and visa-versa. Moreover, it is now recognized 
that PSA is not prostate- specific (Gao et al, supra, for review). 

Various methods designed to improve the specificity of PSA-based detection have 

25 been described, such as measuring PSA density and the ratio of free vs. complexed PSA. 
However, none of these methodologies have been able to reproducibly distinguish benign 
from malignant prostate disease. In addition, PSA diagnostics have sensitivities of between 
57-79% (Cupp & Osterling, 1993, Mayo Clin Proc 68:297-306), and thus miss identifying 
prostate cancer in a significant population of men with the disease. 

30 There are some known markers which are expressed predominantly in prostate, such 

as prostate specific membrane antigen (PSM), a hydrolase with 85% identity to a rat 
neuropeptide (Carter et al., 1996, Proc. Natl. Acad. Sci. USA 93: 749; Bzdega et al., 1997, 
J. Neurochem. 69: 2270). However, the expression of PSM in small intestine and brain 
(Israeli et al., 1994, Cancer Res. 54: 1807), as well its potential role in neuropeptide 
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catabolism in brain, raises concern of potential neurotoxicity with anti-PSM therapies. 
Preliminary results using an Indium-Ill labeled, anti-PSM monoclonal antibody to image 
recurrent prostate cancer show some promise (Sodee et al., 1996, Clin Nuc Med 21: 759- 
766). More recently identified prostate cancer markers include PCTA-1 (Su et al, 1996, 

5 Proc. Natl. Acad. Sci. USA 93: 7252) and prostate stem cell antigen (PSCA) (Reiter et al., 
1998, Proc. Nad. Acad. Sci. USA 95: 1735). PCTA-1, a novel galectin, is largely secreted 
into the media of expressing cells and may hold promise as a diagnostic serum marker for 
prostate cancer (Su et al, 1996). PSCA, a GPI-linked cell surface molecule, was cloned 
from LAPC-4 cDNA and is unique in that it is expressed primarily in basal cells of normal 

10 prostate tissue and in cancer epithelia (Reiter et al., 1998). Vaccines for prostate cancer are 
also being actively explored with a variety of antigens, including PSM and PSA. 

SUMMARY OF THE INVENTION 

The present invention relates to the gene designated 20P2H8, which is over- 

15 expressed in cancers including cancer of the prostate. Northern blot expression analysis of 
20P2H8 gene expression in normal tissues shows a predominant transcript of about 4.4 Kb 
that is highly expressed in pancreas, and is also expressed in prostate, colon and placenta. 
The predicted molecular weight of the 20P2H8 protein is 57.4 kD and its' pi is 7.7. In 
addition, expression analysis demonstrates high levels of 20P2H8 expression in several 

20 prostate and other cancer cell lines as well as prostate, bladder, kidney and breast cancer 
patient samples and tumor xenografts. The expression profile of 20P2H8 in normal adult 
tissues, combined with the over-expression observed in cancer cells such as pancreas, 
bladder, testis, lung, ovary and prostate cancer cell lines and/or cancer patient samples, 
provides evidence that 20P2H8 is aberrantly expressed in at least some cancers, and can 

25 serve as a useful diagnostic and/ or therapeutic target for such cancers. 

The invention provides polynucleotides corresponding or complementary to all or 
part of the 20P2H8 genes, mRNAs, and/or coding sequences, preferably in isolated form, 
including polynucleotides encoding 20P2H8 proteins and fragments thereof, DNA, RNA, 
DNA/RNA hybrid, and related molecules, polynucleotides or oligonucleotides 

30 complementary to the 20P2H8 genes or mRNA sequences or parts thereof, and 
polynucleotides or oligonucleotides that hybridize to the 20P2H8 genes, mRNAs, or to 
20P2H8-encoding polynucleotides. Also provided are means for isolating cDNAs and the 
genes encoding 20P2H8. Recombinant DNA molecules containing 20P2H8 polynucleotides, 
cells transformed or transduced with such molecules, and host-vector systems for the 
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expression of 20P2H8 gene products are also provided. The invention further provides 
20P2H8 proteins and polypeptide fragments thereof. The invention further provides 
antibodies that bind to 20P2H8 proteins and polypeptide fragments thereof, including 
polyclonal and monoclonal antibodies, murine and other mammalian antibodies, chimeric 
5 antibodies, humanized and fully human antibodies, and antibodies labeled with a detectable 
marker. 

The invention further provides methods for detecting the presence and status of 
20P2H8 polynucleotides and proteins in various biological samples, as well as methods for 
identifying cells that express 20P2H8. A typical embodiment of this invention provides 
10 methods for monitoring 20P2H8 gene products in a tissue sample having or suspected of 
having some form of growth dysregulation such as cancer. 

The invention further provides various therapeutic compositions and strategies for 
treating cancers that express 20P2H8 such as prostate cancers, including therapies aimed at 
inhibiting the transcription, translation, processing or function of 20P2H8 as well as cancer 
15 vaccines. 



BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A-1E show the nucleotide (SEQ ID NO: 1) and deduced amino acid (SEQ 
ID NO: 2) sequences of 20P2H8 cDNA. The start methionine and putative Kozak 

20 sequence are indicated in bold, RNA-binding domains RNP1 and RNP2 (shaded) are boxed, 
3 Proline-rich regions are in bold. 

Figures 2A-2C show a semi-quantitative RT-PCR analysis of 20P2H8 gene 
expression in a panel of 16 normal human tissues (Panels B and C) and several prostate 
cancer xenografts (Panel A). Lanes 1-8 in panel A correspond to RT-PCR analysis of 

25 20P2H8 gene expression in brain, prostate, LAPC-4 AD, LAPC-4 AI, LAPC-9 AD, HeLa, 
mouse mix and a negative control respectively. Lanes 1-8 in panel B correspond to RT- 
PCR analysis of 20P2H8 gene expression in brain, heart, kidney, liver, lung, pancreas, 
placenta and skeletal muscle respectively. Lanes 1-8 in panel C correspond to RT-PCR 
analysis of 20P2H8 gene expression in colon, ovary, leukocytes, prostate, small intestine, 

30 spleen, testis and thymus respectively. 

Figures 3A-3C show a Northern blot analysis of human 20P2H8 expression in 
various normal tissues showing predominant expression in pancreas with significant 
expression also detected in prostate and colon. Lanes 1-8 in panel A correspond to Nothern 
analysis of 20P2H8 gene expression in heart, brain, placenta, lung, liver, skeletal muscle, 

5 
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kidney and pancreas respectively. Lanes 1-8 in panel B correspond to Nothern analysis of 
20P2H8 gene expression in spleen, thymus, prostate, testis, ovary, small intestine, colon and 
leukocytes respectively. Lanes 1-5 in panel C correspond to Nothern analysis of 20P2H8 
gene expression in prostate, LAPC-4 AD, LAPC-4 AI, LAPC-9AD and LAPC-9 AI 
5 respectively. 

Figure 4 shows a Northern blot analysis of human 20P2H8 mRNA expression in a 
panel of human colon and pancreatic cancer cell lines. 

Figures 5A-5B show a Northern expression analysis of 20P2H8 mRNA in many 
cancer lines including bladder, lung, breast, testicular cervical and ovarian cancers. 

10 Figure 6 illustrates an RT-PCR analysis showing expression of the of 20P2H8 

mRNA in bladder cancer, colon cancer, and lung cancer patients. 

Figures 7A-7B show expression of recombinant 20P2H8 protein. A. 293T cells 
were transiently transfected with either pCDNA3.1 MycHis tagged 20P2H8 plasmid or with 
empty control vector and harvested 2 days later. Cells were lysed in SDS-PAGE sample 

15 buffer and lysates were separated on a 10-20% SDS-PAGE gel and then transferred to 
nitrocellulose. The blot was blocked in Tris-buffered saline (TBS)+ 2% non-fat milk and 
then probed with a 1:3,000 dilution of rabbit anti-His pAb (Santa Cruz Biotechnology Inc.) 
in TBS+0.15% Tween-20+ 1% milk. The blot was washed and then incubated with a 
1:4,000 dilution of anti-rabbit HRP conjugate secondary antibody. Following washing, anti- 

20 His immunoreactive bands were developed by enhanced chemiluminescence and visualized 
by exposure to autoradiographic film. Indicated by arrow is a specific anti-His 
immunoreactive band of approximately 60Kd that corresponds to expression of the 
Myc/His-tagged 20P2H8 protein in the transfected cells. B. Cell lysates of 20P2H8 mRNA 
negative 293T cells and of 20P2H8 mRNA positive Colo 205 cells were separated by SDS- 

25 PAGE and subjected to Western blot analysis as indicated above but using an anti-20P2H8 
affinity purified polyclonal antibody (4 ug/ml). Indicated with arrows are anti-20P2H8 
immunoreactive bands of approximately 58 kD and 30 kD that appear in 20P2H8 mRNA 
positive Colo 205 cells but not in 20P2H8 mRNA negative 293T cells. 

Figure 8 shows a Northern analysis from tissues from patients with bladder cancer 

30 and unaffected individuals. Northern analysis was performed on bladder cancer and their 
adjacent normal matched tissues obtained from four bladder cancer patients. For this 
experiment, lOjag of total RNA was loaded for each sample. Overexpression of 20P2H8 
was seen in 3 out of 4 tumor samples tested. No expression was observed in any of the 3 
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normal matched tissues tested, nor in bladder isolated from a normal individual The data 
demonstrate specific expression of 20P2H8 in cancer but not in normal bladder tissues. 

Figure 9 shows a RNA dot blot analysis from tissues from patients with various 
cancers. Expression of 20P2H8 was assayed in a panel of human cancers (T) and their 
5 respective matched normal tissues (N) on RNA dot blots. 20P2H8 overexpression was 
detected in 4 out of 14 kidney cancers, 7 out of 9 breast cancers 1 out of 3 prostate cancers, 
2 out of 3 ovarian cancers, 1 out of 1 cervical cancer, and 4 out of 8 stomach cancers. 
20P2H8 was also found to be highly expressed in the melanoma cell line G361 and the 
colorectal adenocarcinoma line SW480. 

10 

DETAILED DESCRIPTION OF THE INVENTION 

Unless otherwise defined, all terms of art, notations and other scientific terminology 
used herein are intended to have the meanings commonly understood by those of skill in 
the art to which this invention pertains. In some cases, terms with commonly understood 

15 meanings are defined herein for clarity and/or for ready reference, and the inclusion of such 
definitions herein should not necessarily be construed to represent a substantial difference 
over what is generally understood in the art. The techniques and procedures described or 
referenced herein are generally well understood and commonly employed using 
conventional methodology by those skilled in the art, such as, for example, the widely 

20 utilized molecular cloning methodologies described in Sambrook et aL, Molecular Cloning: 
A Laboratory Manual 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y. As appropriate, procedures involving the use of commercially 
available kits and reagents are generally carried out in accordance with manufacturer defined 
protocols and/ or parameters unless otherwise noted. 

25 As used herein, the terms "advanced prostate cancer 55 , "locally advanced prostate 

cancer", "advanced disease 55 and "locally advanced disease 55 mean prostate cancers which 
have extended through the prostate capsule, and are meant to include stage C disease under 
the American Urological Association (AUA) system, stage CI - C2 disease under the 
Whitmore-Jewett system, and stage T3 - T4 and N+ disease under the TNM (tumor, node, 

30 metastasis) system. In general, surgery is not recommended for patients with locally 
advanced disease, and these patients have substantially less favorable outcomes compared to 
patients having clinically localized (organ-confined) prostate cancer. Locally advanced 
disease is clinically identified by palpable evidence of induration beyond the lateral border of 
the prostate, or asymmetry or induration above the prostate base. Locally advanced prostate 

7 
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cancer is presently diagnosed pathologically following radical prostatectomy if the tumor 
invades or penetrates the prostatic capsule, extends into the surgical margin, or invades the 
seminal vesicles. 

As used herein, the terms "metastatic prostate cancer" and "metastatic disease" 
5 mean prostate cancers which have spread to regional lymph nodes or to distant sites, and 
are meant to include stage D disease under the AUA system and stage TxNxM+ under the 
TNM system. As is the case with locally advanced prostate cancer, surgery is generally not 
indicated for patients with metastatic disease, and hormonal (androgen ablation) therapy is 
the preferred treatment modality. Patients with metastatic prostate cancer eventually 
10 develop an androgen-refractory state within 12 to 18 months of treatment initiation, and 
approximately half of these patients die within 6 months thereafter. The most common site 
for prostate cancer metastasis is bone. Prostate cancer bone metastases are, on balance, 
characteristically osteoblastic rather than osteolytic (i.e., resulting in net bone formation). 
Bone metastases are found most frequently in the spine, followed by the femur, pelvis, rib 
15 cage, skull and humerus. Other common sites for metastasis include lymph nodes, lung, 
liver and brain. Metastatic prostate cancer is typically diagnosed by open or laparoscopic 
pelvic lymphadenectomy, whole body radionuclide scans, skeletal radiography, and/ or bone 
lesion biopsy. 

As used herein, the term "polynucleotide" means a polymeric form of nucleotides of 
20 at least about 10 bases or base pairs in length, either ribonucleotides or deoxynucleotides or 
a modified form of either type of nucleotide, and is meant to include single and double 
stranded forms of DNA. 

As used herein, the term "polypeptide" means a polymer of at least about 6 amino 
acids. Throughout the specification, standard three letter or single letter designations for 
25 amino acids are used. 

As used herein, the terms "hybridize", "hybridizing", "hybridizes" and the like, used 
in the context of polynucleotides, are meant to refer to conventional hybridization 
conditions, preferably such as hybridization in 50% formamide/6XSSC/0.1% SDS/100 
(Ig/ml ssDNA, in which temperatures for hybridization are above 37 degrees C and 
30 temperatures for washing in 0.1X SSC/0.1% SDS are above 55 degrees C, and most 
preferably to stringent hybridization conditions. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary 
skill in the art, and generally is an empirical calculation dependent upon probe length, 
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washing temperature, and salt concentration. In general, longer probes require higher 
temperatures for proper annealing, while shorter probes need lower temperatures. 
Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. The 

5 higher the degree of desired homology between the probe and hybridizable sequence, the 
higher the relative temperature that can be used. As a result, it follows that higher relative 
temperatures would tend to make the reaction conditions more stringent, while lower 
temperatures less so. For additional details and explanation of stringency of hybridization 
reactions, see Ausubel et al., Current Protocols in Molecular Biology, Wiley Interscience 

10 Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be 
identified by those that: (1) employ low ionic strength and high temperature for washing, for 
example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 
50°C; (2) employ during hybridization a denaturing agent, such as formamide, for example, 

15 50% (v/v) formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% 
polyvinylpyrroUdone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 
pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 (ig/ml), 0.1% 

20 SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium, citrate) and 50% formamide at 55°C, followed by a high-stringency wash 
consisting of 0.1 x SSC containing EDTA at 55°C. 

"Moderately stringent conditions" may be identified as described by Sambrook et al., 
1989, Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, and 

25 include the use of washing solution and hybridization conditions (e.g., temperature, ionic 
strength and %SDS) less stringent than those described above. An example of moderately 
stringent conditions is overnight incubation at 37°C in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/mL denatured sheared 

30 salmon sperm DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The 
skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary 
to accommodate factors such as probe length and the like. 
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In the context of amino acid sequence comparisons, the term "identity" is used to 
express the percentage of amino acid residues at the same relative positions that are the 
same. Also in this context, the term "homology" is used to express the percentage of amino 
acid residues at the same relative positions that are either identical or are similar, using the 

5 conserved amino acid criteria of BLAST analysis, as is generally understood in the art. For 
example, % identity values may be generated by WU-BLAST-2 (Altschul et al., 1996, 
Methods in Enzymology 266:460-480; http://blast.wusd/edu/blast/README.html). 
Further details regarding amino acid substitutions, which are considered conservative under 
such criteria, are provided below. Additional definitions are provided throughout the 

10 subsections that follow. 

As discussed in detail below, the present invention relates to a novel protein 
designated 20P2H8 which shares homology with several heterogenous nuclear 
ribonucleoproteins (hnRNPs). A full length approximately 3600 bp 20P2H8 cDNA (SEQ 
ID NO: 1), encoding a 517 amino acid open reading frame (SEQ ID NO: 2), is provided 

15 herein. The predicted 20P2H8 protein exhibits five RNA binding sequences, two of which 
correspond to ribonucleoprotein-l (RNP1) consensus sites, and three of which correspond 
to RNP2 sites. In addition, 20P2H8 contains three regions with significant proline content 
(30-42%), which he outside regions of homology with hnRNPs. These proline rich regions 
may be involved in protein-protein interactions as has been observed in other proteins 

20 (Schlessenger et al., 1994, Curr. Opin. Genet. Dev. 4: 25). The 20P2H8 protein structure 
shows highest homology with a protein identified in C. elegans (Genbank CAA92704), and 
significant homology is also seen with various hnRNPs involved in mRNA splicing (ROF, 
ROH1 and ROH2; Honore et al, 1995, J. Biol. Chem. 270: 28780). Accordingly, based on 
these structural features and homologies, it is likely that the 20P2H8 protein is involved in 

25 RNA splicing. 

Expression analysis by northern blot shows highest expression levels of a single 4.4 
kb 20P2H8 transcript in pancreas, and is also expressed in prostate, colon and placenta. No 
other normal tissues in this panel show detectable expression. Expression of 20P2H8 is up- 
regulated in human prostate tumor xenografts derived from metastatic prostate cancer 
30 (compared to normal prostate). Further, analysis of 20P2H8 expression in multiple cancer 
cell lines reveals high levels of expression in several pancreatic (BxPC-3, HPAC, Capan-1) 
and colon (CaCo-2, T84, Colo-205) cancer cell lines. The up-regulated expression of 
20P2H8 in cancers including cancers of the bladder, kidney, prostate, pancreas and colon 
indicates that the 20P2H8 gene, message and protein allows 20P2H8 to be used as a 
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diagnostic, staging and/or prognostic marker for cancers of the prostate, colon and 
pancreas, and/or may also serve as a target for various approaches to the treatment of these 
cancers. 

The invention provides polynucleotides corresponding or complementary to all or 

5 part of the 20P2H8 genes, mRNAs, and/or coding sequences, preferably in isolated form, 
including polynucleotides encoding 20P2H8 proteins and fragments thereof, DNA, RNA, 
DNA/RNA hybrid, and related molecules, polynucleotides or oligonucleotides 
complementary to the 20P2H8 genes or mRNA sequences or parts thereof, and 
polynucleotides or oligonucleotides which hybridi2e to the 20P2H8 genes, mRNAs, or to 

10 20P2H8-encoding polynucleotides. Also provided are means for isolating cDNAs and the 
genes encoding 20P2H8. Recombinant DNA molecules containing 20P2H8 polynucleotides, 
cells transformed or transduced with such molecules, and host-vector systems for the 
expression of 20P2H8 gene products are also provided. The invention further provides 
20P2H8 proteins and polypeptide fragments thereof. The invention further provides 

15 antibodies that bind to 20P2H8 proteins and polypeptide fragments thereof, including 
polyclonal and monoclonal antibodies, murine and other mammalian antibodies, chimeric 
antibodies, humanized and fully human antibodies, antibodies labeled with a detectable 
marker, and antibodies conjugated to radionuclides, toxins or other therapeutic 
compositions. The invention further provides methods for detecting the presence of 

20 20P2H8 polynucleotides and proteins in various biological samples, as well as methods for 
identifying cells that express a 20P2H8. The invention further provides various therapeutic 
compositions and strategies for treating cancers which express 20P2H8 such as prostate and 
bladder cancers, including antibody, vaccine and small molecule therapy, and therapies aimed at 
inhibiting the transcription, translation, processing or function of 20P2H8. 

25 

MOLECULAR BIOLOGY OF 20P2H8 

As is further described in the Examples that follow, the 20P2H8 gene and protein 
have been characterized using a number of analytical approaches. For example, analyses of 
nucleotide coding and amino acid sequences were conducted in order to identify potentially 
30 related molecules, as well as recognizable structural domains, topological features, and other 
elements within the 20P2H8 mRNA and protein structure. Northern blot analyses of 
20P2H8 mRNA expression were conducted in order to establish the range of normal and 
cancerous tissues expressing 20P2H8 message. 
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The isolated 20P2H8 cDNA (SEQ ID NO: 1) provided herein (FIG. 1) and 
corresponding gene are predicted to encode a 517 amino acid protein (SEQ ID NO: 2) with 
various structural features common to heterogenous nuclear ribonucleoproteins (hnRNPs) 
involved in RNA splicing, including five RNA binding sequences (two corresponding to 
ribonucleoprotein-1 (RNP1) and 3 corresponding to ribonucleoprotein-2 (RNP2). The 
protein exhibits significant homology to a C. elegans protein, designated CAA92704 
(approximately 52.4% identity in a 311 residues overlap beginning with 20P2H8 amino acid 
residue 63 as shown in FIG. 1), as well as several hnRNPs involved in RNA splicing (e.g., 
ROF, ROH1 and ROH; Honore et al., 1995, J. Biol. Chem. 270: 28780). In addition, the 
20P2H8 protein contains three regions with significant proline content (30-42%), which lie 
outside its regions of homology with hnRNPs. These proline rich regions may be involved 
in protein-protein interactions as has been observed in other proteins (Schlessenger et al, 
1994, Curr. Opin. Genet Dev. 4: 25). 

The 20P2H8 gene is normally expressed predominantly in pancreas, with lower 
levels of expression occurring in prostate, colon and placenta (FIGS. 2 and. 3), but is also 
expressed or over-expressed in a number of human cancers, including cancers of the 
prostate, kidney, skin, stomach, cervix, bladder, testis, ovaries, breast, pancreas, colon and 
lung (see e.g. FIGS. 4-8). 

20P2H8 overexpression in the prostate cancer xenografts as well as kidney, stomach, 
breast and is an indication that this molecule is deregulated in prostate cancer. Therefore, 
20P2H8 target for prostate cancer diagnosis and therapy. For example, interfering with 
20P2H8 function, using an antibody and/or a small molecule, in prostate cancer cells may 
prevent de-differentiation and/or proliferation of cells. In addition, small molecules and/ or 
specific antibodies may interfere with 20P2H8 function making this molecule a candidate 
for vaccine methodologies. Investigating 20P2H8 function may also lead to identification of 
other potential targets. 

20P2H8 function can be assessed in mammalian cells using a variety of techniques 
that are well known in the art. For mammalian expression, 20P2H8 can be cloned into 
several vectors, including pcDNA 3.1 myc-His-tag (Invitrogen)and the retroviral vector 
pSRoctkneo (Mullet et al., 1991, MCB 11:1785). Using these expression vectors, 20P2H8 
can be expressed in several cell lines, including PC-3, NIH 3T3, LNCaP and 293T. 
Expression of 20P2H8 can be monitored using northern blot analysis. 
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The mammalian cell lines expressing 20P2H8 can be tested in several in vitro and in 
vivo assays, including cell proliferation in tissue culture, activation of apoptotic signals, 
tumor formation in SCID mice, and in vitro invasion using a membrane invasion culture 
system (MICS) (Welch et al., Int. J. Cancer 43: 449-457). The 20P2H8 cell phenotype can 

5 be compared to the phenotype of cells that lack expression of 20P2H8. 

As disclosed herein, 20P2H8 exhibits specific properties that are analogous to those 
found in a family of genes whose polynucleotides, polypeptides and anti-polypeptide 
antibodies are used in well known diagnostic assays directed to examining conditions 
associated with dysregulated cell growth such as cancer, in particular prostate cancer (see e.g. 

10 both its highly specific pattern of tissue expression as well as its overexpression in prostate 
cancers as described for example in Example 3). The best known member of this class is 
PSA, the archetypal marker that has been used by medical practitioners for years to identify 
and monitor the presence of prostate cancer (see e.g. Merrill et al., J. Urol. 163(2): 503-5120 
(2000); Polascik et al., J. Urol. Aug;162(2):293-306 (1999) and Fortier et al, J. Nat Cancer 

15 Inst. 91(19): 1635-1640(1999)). A variety of other diagnostic markers are also used in this 
context including p53 and K-ras (see e.g. Tulchinsky et al, IntJ Mol Med 1999 Jul;4(l):99- 
102 and Minimoto et al, Cancer Detect Prev 2000;24(1):1-12). Consequendy, this 
disclosure of the 20P2H8 polynucleotides and polypeptides (as well as the 20P2H8 
polynucleotide probes and anti-20P2H8 antibodies used to identify the presence of these 

20 molecules) and their properties allows skilled artisans to utilize these molecules in methods 
that are analogous to those used, for example, in a variety of diagnostic assays directed to 
examining conditions associated with cancer. 

Typical embodiments of diagnostic methods which utilize the 20P2H8 
polynucleotides, polypeptides and antibodies described herein are analogous to those 

25 methods from well established diagnostic assays which employ PSA polynucleotides, 
polypeptides and antibodies. For example, just as PSA polynucleotides are used as probes 
(for example in Northern analysis, see e.g. Sharief et al, Biochem. Mol. Biol. Int. 33(3):567- 
74(1994)) and primers (for example in PCR analysis, see e.g. Okegawa et al, J. Urol. 163(4): 
1189-1190 (2000)) to observe the presence and/or the level of PSA mRNAs in methods of 

30 monitoring PSA overexpression or the metastasis of prostate cancers, the 20P2H8 
polynucleotides described herein can be utilized in the same way to detect 20P2H8 
overexpression or the metastasis of prostate and other cancers expressing this gene. 
Alternatively, just as PSA polypeptides are used to generate antibodies specific for PSA 
which can then be used to observe the presence and/or the level of PSA proteins in 
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methods of monitoring PSA protein overexpression (see e.g. Stephan et al. Urology 
55(4):560-3 (2000)) or the metastasis of prostate cells (see e.g. Alanen et aL, Pathol. Res. 
Pract. 192(3):233-7 (1996)), the 20P2H8 polypeptides described herein can be utilized to 
generate antibodies for use in detecting 20P2H8 overexpression or the metastasis of 
prostate cells and cells of other cancers expressing this gene. 

Specifically, because metastases involves the movement of cancer cells from an 
organ of origin (such as the bladder, kidney or prostate gland etc.) to a different area of the 
body (such as a lymph node), assays which examine a biological sample for the presence of 
cells expressing 20P2H8 polynucleotides and/or polypeptides can be used to provide 
evidence of metastasis, for example, when a biological sample from tissue that does not 
normally contain 20P2H8 expressing cells (lymph node) is found to contain 20P2H8 
expressing cells such as the 20P2H8 expression seen in LAPC4 and LAPC9, xenografts 
isolated from lymph node and bone metastasis respectively. Alternatively 20P2H8 
polynucleotides and/or polypeptides can be used to provide evidence of cancer, for 
example, when a cells in biological sample that do not normally express 20P2H8 or express 
20P2H8 at a different level (such as kidney, bladder, lung and prostate cells etc.) are found 
to express 20P2H8 or have an increased expression of 20P2H8. In such assays, artisans may 
further wish to generate supplementary evidence of metastasis by testing the biological 
sample for the presence of a second tissue restricted marker (in addition to 20P2H8) such as 
PSA, PSCA etc. (see e.g. Alanen et al, Pathol. Res. Pract. 192(3): 233-237 (1996)). 

Just as PSA polynucleotide fragments and polynucleotide variants are employed by 
skilled artisans for use in methods of monitoring this molecule, 20P2H8 polynucleotide 
fragments and polynucleotide variants can also be used in an analogous manner. In 
particular, typical PSA polynucleotides used in methods of monitoring this molecule are 
probes or primers which consist of fragments of the PSA cDNA sequence. Illustrating this, 
primers used to PCR amplify a PSA polynucleotide must include less than the whole PSA 
sequence to function in the polymerase chain reaction. In the context of such PCR 
reactions, skilled artisans generally create a variety of different polynucleotide fragments that 
can be used as primers in order to amplify different portions of a polynucleotide of interest 
or to optimize amplification reactions (see e.g. Caetano-Anolles, G. Biotechniques 25(3): 
472-476, 478-480 (1998); Robertson et al. Methods Mol. Biol. 98:121-154 (1998)). An 
additional illustration of the utility of such fragments is provided in Example 3, where a 
20P2H8 polynucleotide fragment is used as a probe to show the overexpression of 20P2H8 
mRNAs in cancer cells. In addition, in order to facilitate their use by medical practitioners, 
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valiant polynucleotide sequences ate typically used as primers and probes for the 
corresponding mRNAs in PCR and Northern analyses (see e.g. Sawai et al., Fetal Diagn. 
Ther. 1996 Nov-Dec;ll(6):407-13 and Current Protocols In Molecular Biology, Volume 2, 
Unit 2, Frederick M. Ausubul et al. eds., 1995)). Polynucleotide fragments and variants are 
5 typically useful in this context as long as they have the common attribute or characteristic of 
being capable of binding to a target polynucleotide sequence (e.g. the 20P2H8 
polynucleotide shown in SEQ ID NO: 1) under conditions of high stringency. 

Just as PSA polypeptide fragments and polypeptide variants are employed by skilled 
artisans for use in methods of monitoring this molecule, 20P2H8 polypeptide fragments and 
10 polypeptide variants can also be used in an analogous manner. In particular, typical PSA 
polypeptides used in methods of monitoring this molecule are fragments of the PSA protein 
which contain an epitope that can be recognized by an antibody which will specifically bind 
to the PSA protein. This practice of using polypeptide fragments or polypeptide variants 
used to generate antibodies (such as anti-PSA antibodies) is typical in the art with a wide 
15 variety of systems such as fusion proteins being used by practitioners (see e.g. Current 
Protocols In Molecular Biology, Volume 2, Unit 16, Frederick M. Ausubul et al. eds., 1995). 
In this context, each of the variety of epitopes in a protein of interest functions to provide 
the architecture upon which the antibody is generated. Typically, skilled artisans generally 
create a variety of different polypeptide fragments that can be used in order to generate 
antibodies specific for different portions of a polypeptide of interest (see e.g. U.S. Patent 
No. 5,840,501 and U.S. Patent No. 5,939,533). For example it may be preferable to utilize a 
polypeptide comprising one of the 20P2H8 biological motifs discussed below. Polypeptide 
fragments and variants are typically useful in this context as long as they have the common 
attribute or characteristic of having an epitope capable of generating an antibody specific for 
25 a target polypeptide sequence (e.g. the 20P2H8 polypeptide shown in SEQ ID NO: 2). 

As shown herein, the 20P2H8 polynucleotides and polypeptides (as well as the 
20P2H8 polynucleotide probes and anti-20P2H8 antibodies used to identify the presence of 
these molecules) exhibit specific properties that make them useful in diagnosing cancers of 
the prostate. The described diagnostic assays that measures the presence of 20P2H8 gene 
products, in order to evaluate the presence or onset of the particular disease conditions 
described herein such as prostate cancer are particularly useful in identifying potential 
candidates for preventive measures or further monitoring, as has been done so successfully 
with PSA. Moreover, these materials satisfy a need in the art for molecules having similar 
characteristics to PSA in situations where, for example, a definite diagnosis of metastasis of 



20 



30 



15 



o 

prostatic origin cannot be made on the basis of a testing for PSA alone (see e.g. Alanen et 
al, Pathol. Res. Pract. 192(3): 233-237 (1996)), and consequendy, materials such as 20P2H8 
polynucleotides and polypeptides (as well as the 20P2H8 polynucleotide probes and anti- 
20P2H8 antibodies used to identify the presence of these molecules) must be employed to 
confirm metastases of prostatic origin. 

Finally, in addition to their use in diagnostic assays, the 20P2H8 polynucleotides 
disclosed herein have a number of other specific utilities such as their use in the 
identification of oncogenetic associated chromosomal abnormalities in 15q22. Moreover, in 
addition to their use in diagnostic assays, the 20P2H8 polypeptides and polynucleotides 
disclosed herein have other utilities such as their use in the forensic analysis of tissues of 
unknown origin (see e.g. Takahama K Forensic Sci Int 1996 Jun 28;80(l-2): 63-9). 

2QP2H8 POLYNUCLEOTIDES 

One aspect of the invention provides polynucleotides corresponding or 
complementary to aU or part of a 20P2H8 gene, mRNA, and/or coding sequence, preferably 
in isolated form, including polynucleotides encoding a 20P2H8 protein and fragments 
thereof, DNA, RNA, DNA/RNA hybrid, and related molecules, polynucleotides or 
oligonucleotides complementary to a 20P2H8 gene or mRNA sequence or a part thereof, 
and polynucleotides or oligonucleotides that hybridize to a 20P2H8 gene, mRNA, or to a 
20P2H8 encoding polynucleotide (collectively, "20P2H8 polynucleotides"). As used herein, 
the 20P2H8 gene and protein is meant to include the 20P2H8 genes and proteins 
specifically described herein and the genes and proteins corresponding to other 20P2H8 
proteins and structurally similar variants of the foregoing. Such other 20P2H8 proteins and 
variants will generally have coding sequences that are highly homologous to the 20P2H8 
coding sequence, and preferably will share at least about 50% amino acid identity and at 
least about 60% amino acid homology (using BLAST criteria), more preferably sharing 70% 
or greater homology (using BLAST criteria). 

A 20P2H8 polynucleotide may comprise a polynucleotide having the nucleotide 
sequence of human 20P2H8 as shown in FIG. 1, wherein T can also be U; a polynucleotide 
which encodes all or part of the 20P2H8 protein; a sequence complementary to the 
foregoing; or a polynucleotide fragment of any of the foregoing. Another embodiment 
comprises a polynucleotide encoding a 20P2H8 polypeptide whose sequence is encoded by 
the cDNA contained in the plasmid as deposited with American Type Culture Collection 
10801 University Boulevard, Manassas, Virginia, USA (ATCC) as Accession No. 207151. 
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Another embodiment comprises a polynucleotide which is capable of hybridizing under 
stringent hybridization conditions to the human 20P2H8 cDNA shown in FIG. 1. 

Typical embodiments of the invention disclosed herein include 20P2H8 
polynucleotides containing specific portions of the 20P2H8 mRNA sequence (and those 
which are complementary to such sequences) such as those that encode the protein and 
fragments thereof. For example, representative embodiments of the invention disclosed 
herein include: polynucleotides encoding about amino acid 1 to about amino acid 10 of the 
20P2H8 protein shown in SEQ ID NO: 2, polynucleotides encoding about amino acid 20 to 
about amino acid 30 of the 20P2H8 protein shown in SEQ ID NO: 2, polynucleotides 
encoding about amino acid 30 to about amino acid 40 of the 20P2H8 protein shown in 
SEQ ID NO: 2, polynucleotides encoding about amino acid 40 to about amino acid 50 of 
the 20P2H8 protein shown in SEQ ID NO: 2, polynucleotides encoding about amino acid 
50 to about amino acid 60 of the 20P2H8 protein shown in SEQ ID NO: 2, polynucleotides 
encoding about amino acid 60 to about amino acid 70 of the 20P2H8 protein shown in 
SEQ ID NO: 2, polynucleotides encoding about amino acid 70 to about amino acid 80 of 
the 20P2H8 protein shown in SEQ ID NO: 2, polynucleotides encoding about amino acid 
80 to about amino acid 90 of the 20P2H8 protein shown in SEQ ID NO: 2 and 
polynucleotides encoding about amino acid 90 to about amino acid 100 of the 20P2H8 
protein shown in SEQ ID NO: 2, etc. Following this scheme, polynucleotides encoding 
portions of the amino acid sequence of amino acids 100-517 of the 20P2H8 protein (SEQ 
ID NO: 2) are typical embodiments of the invention. Polynucleotides encoding larger 
portions of the 20P2H8 protein are also contemplated. For example polynucleotides 
encoding from about amino acid 1 (or 20 or 30 or 40 etc.) to about amino acid 20, (or 30, or 
40 or 50 etc.) of the 20P2H8 protein shown in SEQ ID NO: 2 may be generated by a 
variety of techniques well known in the art. 

Additional illustrative embodiments of 20P2H8 polynucleotides include 
embodiments consisting of a polynucleotide having the sequence as shown in FIG. 1 (SEQ 
ID NO: 1) from about nucleotide residue number 1 through about nucleotide residue 
number 500, from about nucleotide residue number 500 through about nucleotide residue 
number 1000 and from about nucleotide residue number 1000 through about nucleotide 
residue number 3600. These polynucleotide fragments can include any portion of the 
20P2H8 sequence as shown in FIG. 1 (SEQ ID NO: 1), for example a polynucleotide 
having the 517 amino acid ORF within the polynucleotide sequence as shown in FIG. 1 
(SEQ ID NO: 1), e.g. from about nucleotide residue number 450 through about nucleotide 
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residue number 2000. Alternatively, a polynucleotide may include portions of both the 
coding and non-coding regions of the 20P2H8 protein such as a polynucleotide fragment 
consisting of the sequence from about residue 600 through about residue 3600 etc. 

Additional illustrative embodiments of the invention disclosed herein include 
20P2H8 polynucleotide fragments encoding one or more of the biological motifs contained 
within the 20P2H8 protein sequence. Typical polynucleotide fragments of the invention 
include those that encode one or more of the 20P2H8 N-glycosylation sites, casein kinase II 
phosphorylation sites, the RNA binding sequences such as the ribonucleoprotein-1 and 
ribonucleoprotein-2 consensus sites, the proline rich regions or N-myristoylation sites as 
disclosed in greater detail in the text discussing the 20P2H8 protein and polypeptides below. 

The polynucleotides of the preceding paragraphs have a number of different specific 
uses. For example, because the human 20P2H8 gene maps to chromosome 15q22.32-23, 
polynucleotides encoding different regions of the 20P2H8 protein can be used to 
characterize cytogenetic abnormalities on chromosome 15, bands ql3.2-ql4 that have been 
identified as being associated with various cancers. In particular, a variety of chromosomal 
abnormalities in 15q22.32-23 have been identified as frequent cytogenetic abnormalities in a 
number of different cancers (see, e.g., Goffman et al., Cancer Genet. Cytogenet. 1983 8(3): 
197-202; Yeatment et al., Clin. Exp. Metastasis 1996 14(3): 246-252)). Consequently, 
polynucleotides encoding specific regions of the 20P2H8 protein provide new tools that can 
be used to delineate with a greater precision than previously possible, the specific nature of 
the cytogenetic abnormalities in this region of chromosome 15 that may contribute to the 
malignant phenotype. In this context, these polynucleotides satisfy a need in the art for 
expanding the sensitivity of chromosomal screening in order to identify more subtle and less 
common chromosomal abnormalities (see, e.g., Evans et al, 1994, Am. J. Obstet. Gynecol. 
171(4):1055-1057). 

Alternatively, as 20P2H8 is shown to be highly expressed in prostate cancers (see 
e.g. FIG. 2), these polynucleotides may be used in methods assessing the status of 20P2H8 
gene products in normal versus cancerous tissues. Typically, polynucleotides encoding 
specific regions of the 20P2H8 protein may be used to assess the presence of perturbations 
(such as deletions, insertions, point mutations etc.) in specific regions (such as regions 
containing the RNA binding sequences) of the 20P2H8 gene products. Exemplary assays 
include both RT-PCR assays as well as single-strand conformation polymorphism (SSCP) 
analysis (see, e.g., Marrogi et al., 1999, J. Cutan. Pathol. 26(8): 369-378), both of which 
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utilize polynucleotides encoding specific regions of a protein to examine these regions 
within the protein. 

Other specifically contemplated embodiments of the invention disclosed herein are 
genomic DNA, cDNAs, ribozymes, and antisense molecules, as well as nucleic acid molecules 
5 based on an alternative backbone or including alternative bases, whether derived from natural 
sources or synthesized. For example, antisense molecules can be RNAs or other molecules, 
including peptide nucleic acids (PNAs) or non-nucleic acid molecules such as 
phosphorothioate derivatives, that specifically bind DNA or RNA in a base pair-dependent 
manner. A skilled artisan can readily obtain these classes of nucleic acid molecules using the 
10 20P2H8 polynucleotides and polynucleotide sequences disclosed herein. 

Antisense technology entails the administration of exogenous oligonucleotides that 
bind to a target polynucleotide located within the cells. The term "antisense" refers to the 
fact that such oligonucleotides are complementary to their intracellular targets, e.g., 20P2H8. 
See for example, Jack Cohen, 1988, OLIGODEOXYNUCLEOTIDES, Antisense 
15 Inhibitors of Gene Expression, CRC Press; and Synthesis 1:1-5 (1988). The 20P2H8 
antisense oligonucleotides of the present invention include derivatives such as S- 
oligonucleotides (phosphorothioate derivatives or S-oligos, see, Jack Cohen, supra), which 
exhibit enhanced cancer cell growth inhibitory action. S-oHgos (nucleoside 
phosphorothioates) are isoelectronic analogs of an oligonucleotide (O-oligo) in which a 
20 nonbridging oxygen atom of the phosphate group is replaced by a sulfur atom. The S-oligos 
of the present invention may be prepared by treatment of the corresponding O-oligos with 
3H-l,2-benzodithiol-3-one-l,l-dioxide, which is a sulfur transfer reagent. See Iyer, R. P. et 
al, 1990, J. Org. Chem. 55:4693-4698; and Iyer, R. P. et al., 1990, J. Am. Chem. Soc. 
112:1253-1254, the disclosures of which are fully incorporated by reference herein. 
25 Additional 20P2H8 antisense oligonucleotides of the present invention include morpholino 
antisense oligonucleotides known in the art (see e.g. Partridge et al., 1996, Antisense & 
Nucleic Acid Drug Development 6: 169-175). 

The 20P2H8 antisense oligonucleotides of the present invention typically may be 
RNA or DNA that is complementary to and stably hybridizes with the first 100 N-terminal 
codons or last 100 C-terminal codons of the 20P2H8 genomic sequence or the 
corresponding mRNA. While absolute complementarity is not required, high degrees of 
complementarity are preferred. Use of an oligonucleotide complementary to this region 
allows for the selective hybridization to 20P2H8 mRNA and not to mRNA specifying other 
regulatory subunits of protein kinase. Preferably, the 20P2H8 antisense oligonucleotides of 
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the present invention are a 15 to 30-mer fragment of the antisense DNA molecule having a 
sequence that hybridizes to 20P2H8 mRNA. Optionally, 20P2H8 antisense oligonucleotide 
is a 30-mer oligonucleotide that is complementary to a region in the first 10 N-terminal 
codons and last 10 C-terminal codons of 20P2H8. Alternatively, the antisense molecules are 

5 modified to employ ribozymes in the inhibition of 20P2H8 expression (L. A. Couture & D. 
T. Stinchcomb, 1996, Trends Genet. 12: 510-515). 

Further specific embodiments of this aspect of the invention include primers and 
primer pairs, which allow the specific amplification of the polynucleotides of the invention 
or of any specific parts thereof, and probes that selectively or specifically hybridize to 

10 nucleic acid molecules of the invention or to any part thereof. Probes may be labeled with a 
detectable marker, such as, for example, a radioisotope, fluorescent compound, 
bioluminescent compound, a chemiluminescent compound, metal chelator or enzyme. Such 
probes and primers can be used to detect the presence of a 20P2H8 polynucleotide in a 
sample and as a means for detecting a cell expressing a 20P2H8 protein. 

15 Examples of such probes include polypeptides comprising all or part of the human 

20P2H8 cDNA sequences shown in SEQ ID NO: 1. Examples of primer pairs capable of 
specifically amplifying 20P2H8 mRNAs are also described in the Examples that follow. As will 
be understood by the skilled artisan, a great many different primers and probes may be 
prepared based on the sequences provided herein and used effectively to amplify and/ or detect 

20 a 20P2H8 mRNA. 

As used herein, a polynucleotide is said to be "isolated" when it is substantially 
separated from contaminant polynucleotides that correspond or are complementary to genes 
other than the 20P2H8 gene or that encode polypeptides other than 20P2H8 gene product or 
fragments thereof. A skilled artisan can readily employ nucleic acid isolation procedures to 

25 obtain an isolated 20P2H8 polynucleotide. 

The 20P2H8 polynucleotides of the invention are useful for a variety of purposes, 
including but not limited to their use as probes and primers for the amplification and/ or 
detection of the 20P2H8 gene(s), mRNA(s), or fragments thereof; as reagents for the 
diagnosis and/or prognosis of prostate cancer and other cancers; as coding sequences 

30 capable of directing the expression of 20P2H8 polypeptides; as tools for modulating or 
inhibiting the expression of the 20P2H8 gene(s) and/or translation of the 20P2H8 
transcript(s); and as therapeutic agents. 
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ISOLATION OF 20P2H8-ENCODING NUCLEIC ACID MOLECULES 

The 20P2H8 cDNA sequences described herein enable the isolation of other 
polynucleotides encoding 20P2H8 gene produces), as well as the isolation of polynucleotides 
encoding 20P2H8 gene product homologs, alternatively spliced isoforms, allelic variants, and 
mutant forms of the 20P2H8 gene product Various molecular cloning methods that can be 
employed to isolate full length cDNAs encoding a 20P2H8 gene are well known (See, e.g., 
Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, 2d ed., Cold Spring 
Harbor Press, New York; Ausubel et al., eds., 1995, Current Protocols in Molecular Biology, 
Wiley and Sons). For example, lambda phage cloning methodologies may be conveniently 
employed, using commercially available cloning systems (e.g., Lambda ZAP Express, 
Stratagene). Phage clones containing 20P2H8 gene cDNAs may be identified by probing with 
a labeled 20P2H8 cDNA or a fragment thereof. For example, in one embodiment, the 
20P2H8 cDNA (SEQ ID NO: 1) or a portion thereof can be synthesized and used as a probe 
to retrieve overlapping and full length cDNAs corresponding to a 20P2H8 gene. The 20P2H8 
gene itself may be isolated by screening genomic DNA libraries, bacterial artificial chromosome 
libraries (BACs), yeast artificial chromosome libraries (YACs), and the like, with 20P2H8 DNA 
probes or primers. 

RECOMBINANT DNA MOLECULES AND HOST-VECTOR SYSTEMS 

The invention also provides recombinant DNA or RNA molecules containing a 
20P2H8 polynucleotide, including but not limited to phages, plasmids, phagemids, cosmids, 
YACs, BACs, as well as various viral and non-viral vectors well known in the art, and cells 
transformed or transfected with such recombinant DNA or RNA molecules. As used herein, a 
recombinant DNA or RNA molecule is a DNA or RNA molecule that has been subjected to 
molecular manipuktion in vitro. Methods for generating such molecules are well known (see, 
e.g., Sambrook et al, 1989, supra). 

The invention further provides a host-vector system comprising a recombinant DNA 
molecule containing a 20P2H8 polynucleotide within a suitable prokaryotic or eukaryotic 
host cell. Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, or an 
animal cell, such as a mammalian cell or an insect cell (e.g., a baculovirus-infectible cell such 
as an Sf9 or HighFive cell). Examples of suitable mammalian cells include various prostate 
cancer cell lines such as PrEC, LNCaP and TsuPrl, other transferable or transducible 
prostate cancer cell lines, as well as a number of mammalian cells routinely used for the 
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expression of recombinant proteins (e.g., COS, CHO, 293, 293T cells). More particularly, a 
polynucleotide comprising the coding sequence of 20P2H8 may be used to generate 20P2H8 
proteins or fragments thereof using any number of host-vector systems routinely used and 
widely known in the art 

5 A wide range of host-vector systems suitable for the expression of 20P2H8 proteins or 

fragments thereof are available (see, e.g., Sambrook et al., 1989, supra; Current Protocols in 
Molecular Biology, 1995, supra). Preferred vectors for mammalian expression include but are 
not limited to pcDNA 3.1 myc-His-tag (Invitrogen) and the retroviral vector pSRatkneo 
(Muller et al., 1991, MCB 11:1785). Using these expression vectors, 20P2H8 may be 

10 preferably expressed in several prostate cancer and non-prostate cell lines, including for 
example 293, 293T, rat-1, NIH 3T3 and TsuPrl. The host-vector systems of the invention 
are useful for the production of a 20P2H8 protein or fragment thereof. Such host-vector 
systems may be employed to study the functional properties of 20P2H8 and 20P2H8 
mutations. 

15 Recombinant human 20P2H8 protein may be produced by mammalian cells 

transfected with a construct encoding 20P2H8. In an illustrative embodiment described in 
the Examples, 293T cells can be transfected with an expression plasmid encoding 20P2H8, 
the 20P2H8 protein is expressed in the 293T cells, and the recombinant 20P2H8 protein can 
be isolated using standard purification methods (e.g., affinity purification using anti-20P2H8 

20 antibodies). In another embodiment, also described in the Examples herein, the 20P2H8 
coding sequence is subcloned into the retroviral vector pSRaMSVtkneo and used to infect 
various mammalian cell lines, such as NIH 3T3, TsuPrl, 293 and rat-1 in order to establish 
20P2H8 expressing cell lines. Various other expression systems well known in the art may 
also be employed. Expression constructs encoding a leader peptide joined in frame to the 

25 20P2H8 coding sequence may be used for the generation of a secreted form of recombinant 
20P2H8 protein. 

Proteins encoded by the 20P2H8 genes, or by fragments thereof, will have a variety 
of uses, including but not limited to generating antibodies and in methods for identifying 
ligands and other agents and cellular constituents that bind to a 20P2H8 gene product. 
30 Antibodies raised against a 20P2H8 protein (like 20P2H8 polynucleotides) or fragment 
thereof may be useful in diagnostic and prognostic assays, and imaging methodologies in the 
management of human cancers characterized by expression of 20P2H8 protein, including 
but not limited to cancers of the kidney, skin, cervix, prostate, brain, bladder, pancreas, 
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ovaries, lung, testis and breast (see e.g. Figs. 4-8). Such antibodies may be expressed 
intracellularly and used in methods of treating patients with such cancers. Various 
immunological assays useful for the detection of 20P2H8 proteins are contemplated, including 
but not limited to various types of radioimmunoassays, enzyme-linked immunosorbent assays 
(ELISA), en2yme-linked immunofluorescent assays (ELIFA), immunocytochemical methods, 
and the like. Such antibodies may be labeled and used as immunological imaging reagents 
capable of detecting 20P2H8 expressing cells (eg., in radioscintigraphic imaging methods). 
20P2H8 proteins may also be particularly useful in generating cancer vaccines, as further 
described below. 

20P2H8 POLYPEPTIDES 

Another aspect of the present invention provides 20P2H8 proteins and polypeptide 
fragments thereof. The 20P2H8 proteins of the invention include those specifically identified 
herein, as well as allelic variants, conservative substitution variants and homologs that can be 
isolated/generated and characterized without undue experimentation following the methods 
outlined below. Fusion proteins that combine parts of different 20P2H8 proteins or fragments 
thereof, as well as fusion proteins of a 20P2H8 protein and a heterologous polypeptide are also 
included. Such 20P2H8 proteins will be collectively referred to as the 20P2H8 proteins, the 
proteins of the invention, or 20P2H8. As used herein, the term "20P2H8 polypeptide" refers 
to a polypeptide fragment or a 20P2H8 protein of at least 6 amino acids, preferably at least 15 
amino acids. 

Specific embodiments of 20P2H8 proteins comprise a polypeptide having the amino 
acid sequence of human 20P2H8 as shown in SEQ ID NO: 2. Alternatively, embodiments of 
20P2H8 proteins comprise variant polypeptides having alterations in the amino acid sequence 
of human 20P2H8 as shown in SEQ ID NO: 2. 

In general, naturally occurring allelic variants of human 20P2H8 will share a high 
degree of structural identity and homology (e.g., 90% or more identity). Typically, allelic 
variants of the 20P2H8 proteins will contain conservative amino acid substitutions within the 
20P2H8 sequences described herein or will contain a substitution of an amino acid from a 
corresponding position in a 20P2H8 homologue. One class of 20P2H8 allelic variants will be 
proteins that share a high degree of homology with at least a small region of a particular 
20P2H8 amino acid sequence, but will further contain a radical departure from the sequence, 
such as a non-conservative substitution, truncation, insertion or frame shift. 
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Conservative amino acid substitutions can frequently be made in a protein without 
altering either the conformation or the function of the protein. Such changes include 
substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these 
hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; glutamine 

5 (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice versa. Other 
substitutions can also be considered conservative, depending on the environment of the 
particular amino acid and its role in the three-dimensional structure of the protein. For 
example, glycine (G) and alanine (A) can frequendy be interchangeable, as can alanine (A) 
and valine (V). Methionine (M), which is relatively hydrophobic, can frequendy be 

10 interchanged with leucine and isoleucine, and sometimes with valine. Lysine (K) and 
arginine (R) are frequendy interchangeable in locations in which the significant feature of 
the amino acid residue is its charge and the differing pK's of these two amino acid residues 
are not significant. Still other changes can be considered "conservative" in particular 
environments. 

15 Embodiments of the invention disclosed herein include a wide variety of art 

accepted variants of 20P2H8 proteins such as polypeptides having amino acid insertions, 
deletions and substitutions. 20P2H8 variants can be made using methods known in the art 
such as site-directed mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed 
mutagenesis (Carter et aL, 1986, Nucl. Acids Res. 13:4331; Zoller et al., 1987, Nucl. Acids 

20 Res. 10:6487), cassette mutagenesis (Wells et al., 1985, Gene 34:315), restriction selection 
mutagenesis (Wells et al., 1986, Philos. Trans. R. Soc. London Ser. A, 317:415) or other 
known techniques can be performed on the cloned DNA to produce the 20P2H8 variant 
DNA. Scanning amino acid analysis can also be employed to identify one or more amino 
acids along a contiguous sequence. Among the preferred scanning amino acids are relatively 

25 small, neutral amino acids. Such amino acids include alanine, glycine, serine, and cysteine. 
Alanine is typically a preferred scanning amino acid among this group because it eliminates 
the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation 
of the variant. Alanine is also typically preferred because it is the most common amino acid. 
Further, it is frequendy found in both buried and exposed positions (Creighton, The 

30 Proteins, (W.H. Freeman & Co., N.Y.); Chothia, 1976, J. Mol. Biol., 150:1). If alanine 
substitution does not yield adequate amounts of variant, an isosteric amino acid can be used. 

As defined herein, 20P2H8 variants have the distinguishing attribute of having at 
least one epitope in common with a 20P2H8 protein having the amino acid sequence of 
SEQ ID NO: 2, such that an antibody that specifically binds to a 20P2H8 variant will also 
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specifically bind to the 20P2H8 protein having the amino acid sequence of SEQ ID NO: 2. 
A polypeptide ceases to be a variant of the protein shown in SEQ ID NO: 2 when it no 
longer contains an epitope capable of being recognized by an antibody that specifically binds 
to a 20P2H8 protein. Those skilled in the art understand that antibodies that recognize 
5 proteins bind to epitopes of varying size, and a grouping of the order of about six amino 
acids, contiguous or not, is regarded as a typical number of amino acids in a minimal 
epitope. See e.g. Hebbes et al., Mol Immunol (1989) 26(9):865-73; Schwartz et aL, J 
Immunol (1985) 135(4):2598-608. As there are approximately 20 amino acids that can be 
included at a given position within the minimal 6 amino acid epitope, an approximation of 
10 the odds of such an epitope occurring by chance are about 20 6 or about 1 in 64 million. 
Another specific class of 20P2H8 protein variants shares 90% or more identity with the 
amino acid sequence of SEQ ID NO: 2. Another specific class of 20P2H8 protein variants 
comprises one or more of the 20P2H8 biological motifs described below. 

As discussed above, embodiments of the claimed invention include polypeptides 
15 containing less than the 517 amino acid sequence of the 20P2H8 protein shown in SEQ ID 
NO: 2 (and the polynucleotides encoding such polypeptides). For example, representative 
embodiments of the invention disclosed herein include polypeptides consisting of about 
amino acid 1 to about amino acid 10 of the 20P2H8 protein shown in SEQ ID NO: 2, 
polypeptides consisting of about amino acid 20 to about amino acid 30 of the 20P2H8 
20 protein shown in SEQ ID NO: 2, polypeptides consisting of about amino acid 30 to about 
amino acid 40 of the 20P2H8 protein shown in SEQ ID NO: 2, polypeptides consisting of 
about amino acid 40 to about amino acid 50 of the 20P2H8 protein shown in SEQ ID NO: 
2, polypeptides consisting of about amino acid 50 to about amino acid 60 of the 20P2H8 
protein shown in SEQ ID NO: 2, polypeptides consisting of about amino acid 60 to about 
25 amino acid 70 of the 20P2H8 protein shown in SEQ ID NO: 2, polypeptides consisting of 
about amino acid 70 to about amino acid 80 of the 20P2H8 protein shown in SEQ ID NO: 
2, polypeptides consisting of about amino acid 80 to about amino acid 90 of the 20P2H8 
protein shown in SEQ ID NO: 2 and polypeptides consisting of about amino acid 90 to 
about amino acid 100 of the 20P2H8 protein shown in SEQ ID NO: 2, etc. Following this 
30 scheme, polypeptides consisting of portions of the amino acid sequence of amino acids 100- 
517 of the 20P2H8 protein are typical embodiments of the invention. Polypeptides 
consisting of larger portions of the 20P2H8 protein are also contemplated. For example 
polypeptides consisting of about amino acid 1 (or 20 or 30 or 40 etc.) to about amino acid 
20, (or 30, or 40 or 50 etc.) of the 20P2H8 protein shown in SEQ ID NO: 2 may be 
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generated by a variety of techniques well known in the art. 

Additional illustrative embodiments of the invention disclosed herein include 
20P2H8 polypeptides containing the amino acid residues of one or more of the biological 
motifs contained within the 20P2H8 polypeptide sequence as shown in SEQ ID NO: 2. In 

5 one embodiment, typical polypeptides of the invention can contain one or more of the 
20P2H8 N-glycosylation sites such as NYTA at residues 436-439 and/or NLSG at residues 
459-462 (SEQ ID NO: 2). In another embodiment, typical polypeptides of the invention 
can contain one or more of the 20P2H8 casein kinase II phosphorylation sites such as 
SQVE at residues 19-22, SDPE at residues 41-44, SKME at residues 56-59, SDQD at 

10 residues 77-80, TGED at residues 141-144, TSNE at residues 152-155, TAEE at residues 
178-181, TYPD at residues 203-206, TAAE at residues 245-248, TIED at residues 297-300 
and/or SAEE at residues 364-367 (SEQ ID NO: 2). In another embodiment, typical 
polypeptides of the invention can contain one or more of the N-myristoylation sites such as 
GLNIAK at residues 87-92, GAALCL at residues 94-99, GGTSNE at residues 150-155, 

15 GLPFTA at residues 172-177, GGKEGI at residues 194-199, GLPYAA at residues 291- 
296, GGTLNR at residues 374-379, GSPNSL at residues 446-451, GLAYNT at residues 
475-480 and/or GLIHTN at residues 499-504 (SEQ ID NO: 2). In another embodiment, 
typical polypeptides of the invention can contain the one or more of the amidation sites 
such as QGRR at residues 102-105 and/or LGKR at residues 234-237 (SEQ ID NO: 2). In 

20 another embodiment, typical polypeptides of the invention can contain one or more of the 
RNA-binding domains such as those shown in Figure 1. In another embodiment, typical 
polypeptides of the invention can contain one or more of the proline rich regions such as 
those shown in Figure 1. In another embodiment, typical polypeptides of the invention can 
contain one or more of the immunoreactive epitopes identified by a process described 

25 herein such as such as those shown in Table 1. Related embodiments of these inventions 
include polypeptides containing combinations of the different motifs discussed above with 
preferable embodiments being those which contain no insertions, deletions or substitutions 
either within the motifs or the intervening sequences of these polypeptides. In addition, 
embodiments which include a number of either N-terminal and/or C-terminal amino acid 

30 residues on either side of these motifs may be desirable (to, for example, include a greater 
portion of the polypeptide architecture in which the motif is located). Typically the number 
of N-terminal and/or C-terminal amino acid residues on either side of a motif is between 
about 5 to about 50 amino acid residues. 

Illustrative examples of such embodiments includes a polypeptide having one or 
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more motifs selected from the group consisting of SDPE and/or SKME and/ or TGED 
(SEQ ID NO: 2). Alternatively polypeptides having other combinations of the biological 
motifs disclosed herein are also contemplated such as a polypeptide having GLIHN and any 
one of the other biological motifs such as SKME or a polypeptide having GLIHN and any 
5 one of the other biological motifs such as GGKEGI etc. (SEQ ID NO: 2). 

Polypeptides consisting of one or more of the 20P2H8 motifs discussed above are 
useful in elucidating the specific characteristics of a malignant phenotype in view of the 
observation that the 20P2H8 motifs discussed above are associated with growth 
dysregulation and because 20P2H8 is overexpressed in cancers (FIGS. 4 and 5). Casein 

10 kinase II and protein kinase C for example are enzymes known to be associated with the 
development of the malignant phenotype (see e.g. Chen et aL, 1998, Lab Invest, 78(2):165- 
174; Gaiddon et al., 1995, Endocrinology 136(1 0):4331-4338; Hall et aL, 1996, Nucleic Acids 
Research 24(6):1119-1126; Peterziel et aL, 1999, Oncogene 18(46):6322-6329; and O'Bnan, 
1998, Oncol. Rep. 5(2): 305-309). Moreover, both glycosylation and myristoylation are 

15 protein modifications also associated with cancer and cancer progression (see e.g. Dennis et 
aL, 1999, Biochim. Biophys. Acta 1473(l):21-34; Raju et aL, 1997, Exp. Cell Res. 235(1):145- 
154). In addition, the RNA recognition motifs and proline rich regions are also associated 
with oncogenic processes (see e.g. Kennedy et al. Nat Genet. 12(3): 329-331 (1996): Li et 
aL, J. Biol. Chem. 275(30): 23053-23058 (2000) and Xiao et aL, Blood. 2000 Jul 15;96(2):699- 

20 704). 

The polypeptides of the preceding paragraphs have a number of different specific 
uses. As 20P2H8 is shown to be expressed in a variety of cancers including kidney, 
prostate, bladder, testicular, ovarian, breast, pancreas, colon, skin, cervical, stomach and lung 
cancer cell lines and/or patient samples (see e.g. FIGS. 4-8), these polypeptides may be used 

25 in methods assessing the status of 20P2H8 gene products in normal versus cancerous tissues 
and elucidating the malignant phenotype. Typically, polypeptides encoding specific regions 
of the 20P2H8 protein may be used to assess the presence of perturbations (such as 
deletions, insertions, point mutations etc.) in specific regions of the 20P2H8 gene products 
(such as regions containing the RNA binding motifs). Exemplary assays can utilize 

30 antibodies targeting a 20P2H8 polypeptide containing the amino acid residues of one or 
more of the biological motifs contained within the 20P2H8 polypeptide sequence in order 
to evaluate the characteristics of this region in normal versus cancerous tissues. 
Alternatively, 20P2H8 polypeptides containing the amino acid residues of one or more of 
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the biological motifs contained within the 20P2H8 polypeptide sequence can be used to 
screen for factors that interact with that region of 20P2H8. 

As discussed above, redundancy in the genetic code permits variation in 20P2H8 
gene sequences. In particular, one skilled in the art will recognize specific codon 
preferences by a specific host species and can adapt the disclosed sequence as preferred for 
a desired host For example, preferred codon sequences typically have rare codons (i.e. ? 
codons having a usage frequency of less than about 20% in known sequences of the desired 
host) replaced with higher frequency codons. Codon preferences for a specific organism 
may be calculated, for example, by utilizing codon usage tables available on the Internet at 
the following address: http://www.dna.affrc.go.)p/^nakamura/codon.html. Nucleotide 
sequences that have been optimized for a particular host species by replacing any codons 
having a usage frequency of less than about 20% are referred to herein as "codon optimized 
sequences." 

Additional sequence modifications are known to enhance protein expression in a 
cellular host. These include elimination of sequences encoding spurious polyadenylation 
signals, exon/intron splice site signals, transposon-like repeats, and/ or other such well- 
characterized sequences that may be deleterious to gene expression. The GC content of the 
sequence may be adjusted to levels average for a given cellular host, as calculated by 
reference to known genes expressed in the host cell. Where possible, the sequence may also 
be modified to avoid predicted hairpin secondary mRNA structures. Other useful 
modifications include the addition of a translational initiation consensus sequence at the 
start of the open reading frame, as described in Kozak, 1989, Mol. Cell Biol., 9:5073-5080. 
Nucleotide sequences that have been optimized for expression in a given host species by 
elimination of spurious polyadenylation sequences, elimination of exon/intron splicing 
signals, elimination of transposon-like repeats and/or optimization of GC content in 
addition to codon optimization are referred to herein as an "expression enhanced 
sequence." 

20P2H8 proteins may be embodied in many forms, preferably in isolated form. As 
used herein, a protein is said to be "isolated" when physical, mechanical or chemical methods 
are employed to remove the 20P2H8 protein from cellular constituents that are normally 
associated with the protein. A skilled artisan can readily employ standard purification methods 
to obtain an isolated 20P2H8 protein. A purified 20P2H8 protein molecule will be 
substantially free of other proteins or molecules that impair the binding of 20P2H8 to 
antibody or other ligand. The nature and degree of isolation and purification will depend on 
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the intended use. Embodiments of a 20P2H8 protein include a purified 20P2H8 protein and 
a functional, soluble 20P2H8 protein. In one form, such functional, soluble 20P2H8 
proteins or fragments thereof retain the ability to bind antibody or other ligand. 

The invention also provides 20P2H8 polypeptides comprising biologically active 
fragments of the 20P2H8 amino acid sequence, such as a polypeptide corresponding to part 
of the amino acid sequence for 20P2H8 as shown in SEQ ID NO: 2. Such polypeptides of 
the invention exhibit properties of the 20P2H8 protein, such as the ability to elicit the 
generation of antibodies that specifically bind an epitope associated with the 20P2H8 
protein. 

20P2H8 polypeptides can be generated using standard peptide synthesis technology or 
using chemical cleavage methods well known in the art based on the amino acid sequences of 
the human 20P2H8 proteins disclosed herein. Alternatively, recombinant methods can be used 
to generate nucleic acid molecules that encode a polypeptide fragment of a 20P2H8 protein. In 
this regard, the 20P2H8-encoding nucleic acid molecules described herein provide means for 
generating defined fragments of 20P2H8 proteins. 20P2H8 polypeptides are particularly useful 
in generating and characterizing domain specific antibodies (e.g., antibodies recognizing an 
extracellular or intracellular epitope of a 20P2H8 protein), in identifying agents or cellular 
factors that bind to 20P2H8 or a particular structural domain thereof, and in various 
therapeutic contexts, including but not limited to cancer vaccines. 

20P2H8 polypeptides containing particularly interesting structures can be predicted 
and/or identified using various analytical techniques well known in the art, including, for 
example, the methods of Chou-Fasman, Garnier-Robson, Kyte-Doolittle, Eisenberg, Karplus- 
Schultz or Jameson-Wolf analysis, or on the basis of immunogenicity. Fragments containing 
such structures are particularly useful in generating subunit specific anti-20P2H8 antibodies or 
in identifying cellular factors that bind to 20P2H8. 

In an embodiment described in the examples that follow, 20P2H8 can be 
conveniently expressed in cells (such as 293T cells) transfected with a commercially available 
expression vector such as a CMV-driven expression vector encoding 20P2H8 with a C- 
terminal 6XFIis and MYC tag (pcDNA3.1/mycHIS, Invitrogen or Tag5, GenHunter 
Corporation, Nashville TN). The Tag5 vector provides an IgGK secretion signal that can 
be used to facilitate the production of a secreted 20P2H8 protein in transfected cells. The 
secreted HIS-tagged 20P2H8 in the culture media may be purified using a nickel column 
using standard techniques. 
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Modifications of 20P2H8 such as covalent modifications are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
acid residues of a 20P2H8 polypeptide with an organic derivatizing agent that is capable of 
reacting with selected side chains or the N- or C- terminal residues of the 20P2H8. Another 

5 type of covalent modification of the 20P2H8 polypeptide included within the scope of this 
invention comprises altering the native glycosylation pattern of the polypeptide. "Altering 
the native glycosylation pattern 11 is intended for purposes herein to mean deleting one or 
more carbohydrate moieties found in native sequence 20P2H8 (either by removing the 
underlying glycosylation site or by deleting the glycosylation by chemical and/ or enzymatic 

10 means), and/ or adding one or more glycosylation sites that are not present in the native 
sequence 20P2H8. In addition, the phrase includes qualitative changes in the glycosylation 
of the native proteins, involving a change in the nature and proportions of the various 
carbohydrate moieties present. Another type of covalent modification of 20P2H8 
comprises linking the 20P2H8 polypeptide to one of a variety of nonproteinaceous 

15 polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in the 
manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 
or 4,179,337. 

The 20P2H8 of the present invention may also be modified in a way to form a 
chimeric molecule comprising 20P2H8 fused to another, heterologous polypeptide or amino 

20 acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of the 
20P2H8 with a polyhistidine epitope tag, which provides an epitope to which immobilized 
nickel can selectively bind. The epitope tag is generally placed at the amino- or carboxyl- 
terminus of the 20P2H8. In an alternative embodiment, the chimeric molecule may 
comprise a fusion of the 20P2H8 with an immunoglobulin or a particular region of an 

25 immunoglobulin. For a bivalent form of the chimeric molecule (also referred to as an 
"immunoadhesin"), such a fusion could be to the Fc region of an IgG molecule. The Ig 
fusions preferably include the substitution of a soluble (transmembrane domain deleted or 
inactivated) form of a 20P2H8 polypeptide in place of at least one variable region within an 
Ig molecule. In a particularly preferred embodiment, the immunoglobulin fusion includes 

30 the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 regions of an IgGl molecule. 
For the production of immunoglobulin fusions see also US Patent No. 5,428,130 issued 
June 27,1995. 
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2QP2H8 ANTIBODIES 

The term "antibody" is used in the broadest sense and specifically covers single anti- 
20P2H8 monoclonal antibodies (including agonist, antagonist and neutralizing antibodies) and 
anti-20P2H8 antibody compositions with polyepitopic specificity. The term "monoclonal 
antibody" (mAb) as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e. the antibodies comprising the individual population 
are identical except for possible naturally-occurring mutations that may be present in minor 
amounts. 

Another aspect of the invention provides antibodies that bind to 20P2H8 proteins and 
polypeptides. The most preferred antibodies will specifically bind to a 20P2H8 protein and will 
not bind (or will bind weakly) to non-20P2H8 proteins and polypeptides. Anti-20P2H8 
antibodies that are particularly contemplated include monoclonal and polyclonal antibodies as 
well as fragments containing the antigen binding domain and/or one or more complementarity 
determining regions of these antibodies. As used herein, an antibody fragment is defined as 
at least a portion of the variable region of the immunoglobulin molecule that binds to its 
target, i.e., the antigen binding region. 

20P2H8 antibodies of the invention may be particularly useful in prostate cancer 
diagnostic and prognostic assays, and imaging methodologies. Intracellularly expressed 
antibodies (e.g., single chain antibodies) may be therapeutically useful in treating cancers in 
which the expression of 20P2H8 is involved, such as for example advanced and metastatic 
prostate cancers. Such antibodies may be useful in the treatment, diagnosis, and/ or 
prognosis of other cancers, to the extent 20P2H8 is also expressed or overexpressed in 
other types of cancers such as prostate, kidney, bladder, cervical, skin, stomach, testicular, 
ovarian, breast, pancreas, colon and lung cancers. 

The invention also provides various immunological assays useful for the detection and 
quantification of 20P2H8 and mutant 20P2H8 proteins and polypeptides. Such assays generally 
comprise one or more 20P2H8 antibodies capable of recognizing and binding a 20P2H8 or 
mutant 20P2H8 protein, as appropriate, and may be performed within various immunological 
assay formats well known in the art, including but not limited to various types of 
radioimmunoassays, enzyme-linked immunosorbent assays (ELISA), enzyme-linked 
immunofluorescent assays (ELIFA), and the like. In addition, immunological imaging methods 
capable of detecting prostate cancer and other cancers expressing 20P2H8 are also provided by 
the invention, including but limited to radioscintigraphic imaging methods using labeled 



31 



20P2H8 antibodies. Such assays may be clinically useful in the detection, monitoring, and 
prognosis of 20P2H8 expressing cancers such as prostate, bladder, testicular, ovarian, breast, 
pancreas, colon and lung cancer cell lines. 

20P2H8 antibodies may also be used in methods for purifying 20P2H8 and mutant 
5 20P2H8 proteins and polypeptides and for isolating 20P2H8 homologues and related 
molecules. For example, in one embodiment, the method of purifying a 20P2H8 protein 
comprises incubating a 20P2H8 antibody, which has been coupled to a solid matrix, with a 
lysate or other solution containing 20P2H8 under conditions that permit the 20P2H8 antibody 
to bind to 20P2H8; washing the solid matrix to eliminate impurities; and eluting the 20P2H8 

10 from the coupled antibody. Other uses of the 20P2H8 antibodies of the invention include 
generating anti-idiotypic antibodies that mimic the 20P2H8 protein. 

Various methods for the preparation of antibodies are well known in the art. For 
example, antibodies may be prepared by immunizing a suitable mammalian host using a 
20P2H8 protein, peptide, or fragment, in isolated or immunoconjugated form (Harlow, and 

15 Lane, eds., 1988, Antibodies: A Laboratory Manual, CSH Press; Harlow, 1989, Antibodies, 
Cold Spring Harbor Press, NY). In addition, fusion proteins of 20P2H8 may also be used, 
such as a 20P2H8 GST-fusion protein. In a particular embodiment, a GST fusion protein 
comprising all or most of the open reading frame amino acid sequence of SEQ ID NO: 2 may 
be produced and used as an immunogen to generate appropriate antibodies. In another 

20 embodiment, a 20P2H8 peptide may be synthesized and used as an immunogen. 

In addition, naked DNA immunization techniques known in the art may be used (with 
or without purified 20P2H8 protein or 20P2H8 expressing cells) to generate an immune 
response to the encoded immunogen (for review, see Donnelly et al. ? 1997, Ann. Rev. 
Immunol. 15:617-648). 

25 The amino acid sequence of the 20P2H8 as shown in SEQ ID NO: 2 may be used to 

select specific regions of the 20P2H8 protein for generating antibodies. For example, 
hydrophobicity and hydrophilicity analyses of the 20P2H8 amino acid sequence may be used to 
identify hydrophilic regions in the 20P2H8 structure. Regions of the 20P2H8 protein that 
show immunogenic structure, as well as other regions and domains, can readily be identified 

30 using various other methods known in the art, such as Chou-Fasman, Garnier-Robson, Kyte- 
Doolittle, Eisenberg, Karplus-Schultz or Jameson-Wolf analysis. 

Illustrating this, the binding of peptides from 20P2H8 proteins to the human MHC 
class I molecule HLA-A2 are predicted and shown in Table 1 below. Specifically, the 
complete amino acid sequences of 20P2H8 proteins was entered into the HLA Peptide 
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Motif Search algorithm found in the Bioinformatics and Molecular Analysis Section 
(BIMAS) Web site (http://bimas.dcrt.nih.gov/). The HLA Peptide Motif Search algorithm 
was developed by Dr. Ken Parker based on binding of specific peptide sequences in the 
groove of HLA Class I molecules and specifically HLA-A2 (see e.g. Falk et aL, Nature 351: 
290-6 (1991); Hunt et aL, Science 255:1261-3 (1992); Parker et aL, J. Immunol. 149:3580-7 
(1992); Parker et aL, J. Immunol. 152:163-75 (1994)). This algorithm allows location and 
ranking of 8-mer, 9-mer, and 10-mer peptides from a complete protein sequence for 
predicted binding to HLA-A2 as well as other HLA Class I molecules. Most HLA-A2 
binding peptides are 9-mers favorably containing a leucine (L) at position 2 and a valine (V) 
or leucine (L) at position 9 (Parker et aL, J. Immunol. 149:3580-7 (1992)). The results of 
20P2H8 predicted binding peptides are shown in Table 1 below. In Table 1, the top 10 
ranking candidates for each family member are shown along with their location, the amino 
acid sequence of each specific peptide, and an estimated binding score. The binding score 
corresponds to the estimated half-time of dissociation of complexes containing the peptide 
at 37°C at pH 6.5. Peptides with the highest binding score are predicted to be the most 
tightly bound to HLA Class I on the cell surface and thus represent the best immunogenic 
targets for T-cell recognition. Actual binding of peptides to HLA-A2 can be evaluated by 
stabilization of HLA-A2 expression on the antigen-processing defective cell line T2 (see e.g. 
Xue et aL, Prostate 30:73-8 (1997) and Peshwa et aL, Prostate 36:129-38 (1998)). 
Immunogenicity of specific peptides can be evaluated in vitro by stimulation of CD8+ 
cytotoxic T lymphocytes (CTL) in the presence of dendritic cells. 

Methods for preparing a protein or polypeptide for use as an immunogen and for 
preparing immunogenic conjugates of a protein with a carrier such as BSA, KLH, or other 
carrier proteins are well known in the art. In some circumstances, direct conjugation using, for 
example, carbodiimide reagents may be used; in other instances linking reagents such as those 
supplied by Pierce Chemical Co., Rockford, IL, may be effective. Administration of a 20P2H8 
immunogen is conducted generally by injection over a suitable time period and with use of a 
suitable adjuvant, as is generally understood in the art. During the immunization schedule, 
titers of antibodies can be taken to determine adequacy of antibody formation. 

20P2H8 monoclonal antibodies are preferred and may be produced by various means 
well known in the art. For example, immortalized cell lines that secrete a desired monoclonal 
antibody may be prepared using the standard hybridoma technology of Kohler and Milstein or 
modifications that immortalize producing B cells, as is generally known. The immortalized cell 
lines secreting the desired antibodies are screened by immunoassay in which the antigen is the 
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20P2H8 protein or a 20P2H8 fragment. When the appropriate immortalized cell culture 
secreting the desired antibody is identified, the cells may be expanded and antibodies produced 
either from in vitro cultures or from ascites fluid. 

The antibodies or fragments may also be produced, using current technology, by 
5 recombinant means. Regions that bind specifically to the desired regions of the 20P2H8 
protein can also be produced in the context of chimeric or CDR grafted antibodies of multiple 
species origin. Humanized or human 20P2H8 antibodies may also be produced and are 
preferred for use in therapeutic contexts. Methods for humanizing murine and other non- 
human antibodies by substituting one or more of the non-human antibody CDRs for 

10 corresponding human antibody sequences are well known (see for example, Jones et al., 1986, 
Nature 321:522-525; Riechmann et al., 1988, Nature 332:323-327; Verhoeyen et aL, 1988, 
Science 239:1534-1536). See also, Carter et al., 1993, Proc. Natl. Acad. Sci. USA 89:4285 and 
Sims et al., 1993, J. Immunol. 151:2296. Methods for producing fully human monoclonal 
antibodies include phage display and transgenic methods (for review, see Vaughan et al., 1998, 

15 Nature Biotechnology 16:535-539). 

Fully human 20P2H8 monoclonal antibodies may be generated using cloning 
technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, Building an in vitro immune system: human antibodies from 
phage display libraries. In: Clark, M., ed., 1993, Protein Engineering of Antibody Molecules for 

20 Prophylactic and Therapeutic Applications in Man, Nottingham Academic, pp 45-64; Burton 
and Barbas, Human Antibodies from combinatorial libraries. Id., pp 65-82). Fully human 
20P2H8 monoclonal antibodies may also be produced using transgenic mice engineered to 
contain human immunoglobulin gene loci as described in PCT Patent Application 
W098/24893, Kucherlapati and Jakobovits et al., published December 3, 1997 (see also, 

25 Jakobovits, 1998, Exp. Opin. Invest. Drugs 7(4):607-614). This method avoids the in vitro 
manipulation required with phage display technology and efficiently produces high affinity 
authentic human antibodies. 

Reactivity of 20P2H8 antibodies with a 20P2H8 protein may be established by a 
number of well known means, including western blot, immunoprecipitation, ELISA, and 

30 FACS analyses using, as appropriate, 20P2H8 proteins, peptides, 20P2H8-expressing cells or 
extracts thereof. 

A 20P2H8 antibody or fragment thereof of the invention may be labeled with a 
detectable marker or conjugated to a second molecule. Suitable detectable markers include, 
but are not limited to, a radioisotope, a fluorescent compound, a bioluminescent compound, 
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chemiluminescent compound, a metal chelator or an enzyme. A second molecule for 
conjugation to the 20P2H8 antibody can be selected in accordance with the intended use. 
For example, for therapeutic use, the second molecule can be a toxin or therapeutic agent. 
Further, bi-specific antibodies specific for two or more 20P2H8 epitopes may be generated 
using methods generally known in the art. Homodimeric antibodies may also be generated by 
cross-linking techniques known in the art (e.g., Wolff et aL, 1993, Cancer Res. 53: 2560-2565). 

20P2H8 TRANSGENIC ANIMALS 

Nucleic acids that encode 20P2H8 or its modified forms can also be used to 
generate either transgenic animals or "knock out" animals which, in turn, are useful in the 
development and screening of therapeutically useful reagents. A transgenic animal (e.g., a 
mouse or rat) is an animal having cells that contain a transgene, which transgene was 
introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic 
stage. A transgene is a DNA that is integrated into the genome of a cell from which a 
transgenic animal develops. In one embodiment, cDNA encoding 20P2H8 can be used to 
clone genomic DNA encoding 20P2H8 in accordance with established techniques and the 
genomic sequences used to generate transgenic animals that contain cells that express DNA 
encoding 20P2H8. Methods for generating transgenic animals, particularly animals such as 
mice or rats, have become conventional in the art and are described, for example, in U.S. 
Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for 
20P2H8 transgene incorporation with tissue-specific enhancers. Transgenic animals that 
include a copy of a transgene encoding 20P2H8 introduced into the germ line of the animal 
at an embryonic stage can be used to examine the effect of increased expression of DNA 
encoding 20P2H8. Such animals can be used as tester animals for reagents thought to 
confer protection from, for example, pathological conditions associated with its 
overexpression. In accordance with this facet of the invention, an animal is treated with the 
reagent and a reduced incidence of the pathological condition, compared to untreated 
animals bearing the transgene, would indicate a potential therapeutic intervention for the 
pathological condition. 

Alternatively, non-human homologues of 20P2H8 can be used to construct a 
20P2H8 "knock out" animal that has a defective or altered gene encoding 20P2H8 as a 
result of homologous recombination between the endogenous gene encoding 20P2H8 and 
altered genomic DNA encoding 20P2H8 introduced into an embryonic cell of the animal. 
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For example, cDNA encoding 20P2H8 can be used to clone genomic DNA encoding 
20P2H8 in accordance with established techniques. A portion of the genomic DNA 
encoding 20P2H8 can be deleted or replaced with another gene, such as a gene encoding a 
selectable marker that can be used to monitor integration. Typically, several kilobases of 
5 unaltered flanking DNA (both at the 5 f and 3' ends) are included in the vector (see e.g., 
Thomas and Capecchi, 1987, Cell 51:503) for a description of homologous recombination 
vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) 
and cells in which the introduced DNA has homologously recombined with the endogenous 
DNA are selected (see e.g., Li et al., 1992, Cell 69:915). The selected cells are then injected 

10 into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras (see e.g., 
Bradley, in Robertson, ed., 1987, Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, (IRL, Oxford), pp. 113-152). A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the embryo brought to term to create a 
"knock out" animal. Progeny harboring the homologously recombined DNA in their germ 

15 cells can be identified by standard techniques and used to breed animals in which all cells of 
the animal contain the homologously recombined DNA. Knockout animals can be 
characterized for instance, for their ability to defend against certain pathological conditions 
and for their development of pathological conditions due to absence of the 20P2H8 
polypeptide. 

20 

METHODS FOR THE DETECTION OF 20P2H8 

Another aspect of the present invention relates to methods for detecting 20P2H8 
polynucleotides and 20P2H8 proteins and variants thereof, as well as methods for identifying a 
cell that expresses 20P2H8. The expression profile of 20P2H8 makes it a potential 

25 diagnostic marker for local and/or metastasized disease. Northern blot analysis suggests that 
different tissues express different isoforms of 20P2H8. The 20P2H8 isoforms in prostate 
cancer appear to be different from the isoform expressed in normal prostate. In this context, 
the status of 20P2H8 gene products may provide information useful for predicting a variety of 
factors including susceptibility to advanced stage disease, rate of progression, and/ or tumor 

30 aggressiveness. As discussed in detail below, the status of 20P2H8 gene products in patient 
samples may be analyzed by a variety protocols that are well known in the art including 
immunohistochemical analysis, the variety of Northern blotting techniques including in situ 
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hybridization, RT-PCR analysis (for example on laser capture micro-dissected samples), 
western blot analysis and tissue array analysis. 

More particularly, the invention provides assays for the detection of 20P2H8 
polynucleotides in a biological sample, such as serum, bone, prostate, and other tissues, urine, 
semen, cell preparations, and the like. Detectable 20P2H8 polynucleotides include, for 
example, a 20P2H8 gene or fragments thereof, 20P2H8 mRNA, alternative splice variant 
20P2H8 mRNAs, and recombinant DNA or RNA molecules containing a 20P2H8 
polynucleotide. A number of methods for amplifying and/or detecting the presence of 
20P2H8 polynucleotides are well known in the art and may be employed in the practice of this 
aspect of the invention. 

In one embodiment, a method for detecting a 20P2H8 mRNA in a biological sample 
comprises producing cDNA from the sample by reverse transcription using at least one 
primer; amplifying the cDNA so produced using a 20P2H8 polynucleotides as sense and 
antisense primers to amplify 20P2H8 cDNAs therein; and detecting the presence of the 
amplified 20P2H8 cDNA. Optionally, the sequence of the amplified 20P2H8 cDNA can be 
determined. In another embodiment, a method of detecting a 20P2H8 gene in a biological 
sample comprises first isolating genomic DNA from the sample; amplifying the isolated 
genomic DNA using 20P2H8 polynucleotides as sense and antisense primers to amplify the 
20P2H8 gene therein; and detecting the presence of the amplified 20P2H8 gene. Any 
number of appropriate sense and antisense probe combinations may be designed from the 
nucleotide sequences provided for the 20P2H8 (SEQ ID NO: 1) and used for this purpose. 

The invention also provides assays for detecting the presence of a 20P2H8 protein in a 
tissue of other biological sample such as serum, bone, prostate, and other tissues, urine, cell 
preparations, and the like. Methods for detecting a 20P2H8 protein are also well known and 
include, for example, immunoprecipitation, immxinohistochemical analysis, Western Blot 
analysis, molecular binding assays, ELISA, ELIFA and the like. For example, in one 
embodiment, a method of detecting the presence of a 20P2H8 protein in a biological sample 
comprises first contacting the sample with a 20P2H8 antibody, a 20P2H8-reactive fragment 
thereof, or a recombinant protein containing an antigen binding region of a 20P2H8 
antibody; and then detecting the binding of 20P2H8 protein in the sample thereto. 

Methods for identifying a cell that expresses 20P2H8 are also provided. In one 
embodiment, an assay for identifying a cell that expresses a 20P2H8 gene comprises detecting 
the presence of 20P2H8 mRNA in the cell. Methods for the detection of particular mRNAs in 
cells are well known and include, for example, hybridization assays using complementary DNA 
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probes (such as in situ hybridization using labeled 20P2H8 riboprobes, Northern blot and 
related techniques) and various nucleic acid amplification assays (such as RT-PCR using 
complementary primers specific for 20P2H8, and other amplification type detection methods, 
such as, for example, branched DNA, SISBA, TMA and the like). Alternatively, an assay for 
5 identifying a cell that expresses a 20P2H8 gene comprises detecting the presence of 20P2H8 
protein in the cell or secreted by the cell. Various methods for the detection of proteins are 
well known in the art and may be employed for the detection of 20P2H8 proteins and 20P2H8 
expressing cells. 

20P2H8 expression analysis may also be useful as a tool for identifying and evaluating 
10 agents that modulate 20P2H8 gene expression. For example, 20P2H8 expression is 
significantly upregulated in prostate cancer, and may also be expressed in other cancers. 
Identification of a molecule or biological agent that could inhibit 20P2H8 expression or 
over-expression in cancer cells may be of therapeutic value. Such an agent may be identified 
by using a screen that quantifies 20P2H8 expression by RT-PCR, nucleic acid hybridization 
1 5 or antibody binding. 

MONITORING THE STATUS OF 20P2H8 AND ITS PRODUCTS 

Assays that evaluate the status of the 20P2H8 gene and 20P2H8 gene products in an 
individual may provide information on the growth or oncogenic potential of a biological 

20 sample from this individual. For example, because 20P2H8 mRNA is so highly expressed in 
prostate cancers as compared to normal prostate tissue, assays that evaluate the relative levels 
of 20P2H8 mRNA transcripts or proteins in a biological sample may be used to diagnose a 
disease associated with 20P2H8 dysregulation such as cancer and may provide prognostic 
information useful in defining appropriate therapeutic options. 

25 Because 20P2H8 is expressed, for example, in various prostate cancer xenograft tissues 

and cancer cell lines, and cancer patient samples, the expression status of 20P2H8 can provide 
information useful for determining information including the presence, stage and location of 
displasic, precancerous and cancerous cells, predicting susceptibility to various stages of disease, 
and/or for gauging tumor aggressiveness. Moreover, the expression profile makes it a 

30 potential imaging reagent for metastasized disease. Consequently, an important aspect of the 
invention is directed to the various molecular prognostic and diagnostic methods for examining 
the status of 20P2H8 in biological samples such as those from individuals suffering from, or 
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suspected of suffering from a pathology characterized by dysregulated cellular growth such 
as cancer. 

Oncogenesis is known to be a multistep process where cellular growth becomes 
progressively dysregulated and cells progress from a normal physiological state to 

5 precancerous and then cancerous states (see e.g. Alers et aL, Lab Invest 77(5): 437-438 
(1997) and Isaacs et aL, Cancer Surv. 23: 19-32 (1995)). In this context, examining a 
biological sample for evidence of dysregulated cell growth (such as aberrant 20P2H8 
expression in prostate cancers) can allow the early detection of such aberrant cellular 
physiology before a pathology such as cancer has progressed to a stage at which therapeutic 

10 options are more limited. In such examinations, the status of 20P2H8 in a biological sample 
of interest (such as one suspected of having dysregulated cell growth) can be compared, for 
example, to the status of 20P2H8 in a corresponding normal sample (e.g. a sample from that 
individual (or alternatively another individual) that is not effected by a pathology, for 
example one not suspected of having dysregulated cell growth) with alterations in the status 

15 of 20P2H8 in the biological sample of interest (as compared to the normal sample) 
providing evidence of dysregulated cellular growth. In addition to using a biological sample 
that is not effected by a pathology as a normal sample, one can also use a predetermined 
normative value such as a predetermined normal level of mRNA expression (see e.g. Grever 
et aL, J. Comp. Neurol. 1996 Dec 9;376(2):306-14 and U.S. patent No. 5,837,501) to 

20 compare 20P2H8 in normal versus suspect samples. 

The term "status' 1 in this context is used according to its art accepted meaning and 
refers to the condition or state of a gene and its products. As specifically described herein, the 
status of 20P2H8 can be evaluated by a number of parameters known in the art. Typically 
an alteration in the status of 20P2H8 comprises a change in the location of 20P2H8 

25 expressing cells and/or an increase in 20P2H8 mRNA and/ or protein expression. 

Typically, skilled artisans use a number of parameters to evaluate the condition or state 
of a gene and its products. These include, but are not limited to the location of expressed gene 
products (including the location of 20P2H8 expressing cells) as well as the, level, and 
biological activity of expressed gene products (such as 20P2H8 mRNA polynucleotides and 

30 polypeptides). Alterations in the status of 20P2H8 can be evaluated by a wide variety of 
methodologies well known in the art, typically those discussed below. Typically an alteration 
in the status of 20P2H8 comprises a change in the location of 20P2H8 and/or 20P2H8 
expressing cells and/or an increase in 20P2H8 mRNA and/or protein expression. 
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As discussed in detail herein, in order to identify a condition or phenomenon 
associated with dysregulated cell growth, the status of 20P2H8 in a biological sample may be 
evaluated by a number of methods utilized by skilled artisans including, but not limited to 
genomic Southern analysis (to examine, for example perturbations in the 20P2H8 gene), 

5 northerns and/or PCR analysis of 20P2H8 mRNA (to examine, for example alterations in 
the polynucleotide sequences or expression levels of 20P2H8 mRNAs), and western and/ or 
immunohistochemical analysis (to examine, for example alterations in polypeptide 
sequences, alterations in polypeptide localization within a sample, alterations in expression 
levels of 20P2H8 proteins and/or associations of 20P2H8 proteins with polypeptide binding 

10 partners). Detectable 20P2H8 polynucleotides include, for example, a 20P2H8 gene or 
fragments thereof, 20P2H8 mRNA, alternative splice variants 20P2H8 mRNAs, and 
recombinant DNA or RNA molecules containing a 20P2H8 polynucleotide. 

The expression profile of 20P2H8 makes it a potential diagnostic marker for local 
and/or metastasized disease. In particular, the status of 20P2H8 may provide information 

15 useful for predicting susceptibility to particular disease stages, progression, and/or tumor 
aggressiveness. The invention provides methods and assays for determining 20P2H8 status 
and diagnosing cancers that express 20P2H8, such as cancers of the prostate, bladder, testis, 
ovaries, breast, pancreas, colon and lung. 20P2H8 status in patient samples may be analyzed 
by a number of means well known in the art, including without limitation, 

20 immunohistochemical analysis, in situ hybridization, RT-PCR analysis on laser capture micro- 
dissected samples, western blot analysis of clinical samples and cell lines, and tissue array 
analysis. Typical protocols for evaluating the status of the 20P2H8 gene and gene products can 
be found, for example in Ausubul et al. eds., 1995, Current Protocols In Molecular Biology, 
Units 2 [Northern Blotting], 4 [Southern Blotting], 15 [Immunoblotting] and 18 [PCR 

25 Analysis]. 

As described above, the status of 20P2H8 in a biological sample can be examined by 
a number of well known procedures in the art. For example, the status of 20P2H8 in a 
biological sample taken from a specific location in the body can be examined by evaluating 
the sample for the presence or absence of 20P2H8 expressing cells (e.g. those that express 
30 20P2H8 mRNAs or proteins). This examination can provide evidence of dysregulated 
cellular growth for example, when 20P2H8 expressing cells are found in a biological sample 
that does not normally contain such cells (such as a lymph node). Such alterations in the 
status of 20P2H8 in a biological sample are often associated with dysregulated cellular 
growth. Specifically, one indicator of dysregulated cellular growth is the metastases of 
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cancer cells from an organ of origin (such as the testis or prostate gland) to a different area 
of the body (such as a lymph node). In this context, evidence of dysregulated cellular 
growth is important for example because occult lymph node metastases can be detected in a 
substantial proportion of patients with prostate cancer, and such metastases are associated 
5 with known predictors of disease progression (see e.g. J Urol 1995 Aug;154(2 Pt l):474-8). 

In one aspect, the invention provides methods for monitoring 20P2H8 gene 
products by determining the status of 20P2H8 gene products expressed by cells in a test 
tissue sample from an individual suspected of having a disease associated with dysregulated 
cell growth (such as hyperplasia or cancer) and then comparing the status so determined to 

10 the status of 20P2H8 gene products in a corresponding normal sample, the presence of 
aberrant 20P2H8 gene products in the test sample relative to the normal sample providing 
an indication of the presence of dysregulated cell growth within the cells of the individual. 

In another aspect, the invention provides assays useful in determining the presence 
of cancer in an individual, comprising detecting a significant increase in 20P2H8 mRNA or 

15 protein expression in a test cell or tissue sample relative to expression levels in the 
corresponding normal cell or tissue. The presence of 20P2H8 mRNA may, for example, be 
evaluated in tissue samples including but not limited to prostate, kidney, bladder, cervical, 
skin, stomach, testicular, ovarian, breast, pancreas, colon and lung issues (see e.g. FIGS. 4- 
8)* The presence of significant 20P2H8 expression in any of these tissues may be useful to 

20 indicate the emergence, presence and/or severity of these cancers, since the corresponding 
normal tissues do not express 20P2H8 mRNA or express it at lower levels. 

In a related embodiment, 20P2H8 status may be determined at the protein level rather 
than at the nucleic acid level. For example, such a method or assay would comprise 
determining the level of 20P2H8 protein expressed by cells in a test tissue sample and 

25 comparing the level so determined to the level of 20P2H8 expressed in a corresponding normal 
sample. In one embodiment, the presence of 20P2H8 protein is evaluated, for example, 
using immunohistochemical methods. 20P2H8 antibodies or binding partners capable of 
detecting 20P2H8 protein expression may be used in a variety of assay formats well known in 
the art for this purpose. 

30 In other related embodiments, one can evaluate the integrity 20P2H8 nucleotide and 

amino acid sequences in a biological sample in order to identify perturbations in the structure 
of these molecules such as insertions, deletions, substitutions and the like. Such embodiments 
are useful because perturbations in the nucleotide and amino acid sequences are observed in a 
large number of proteins associated with a growth dysregulated phenotype (see, e.g., Marrogi 
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et al., 1999, J. Cutan. Pathol. 26(8):369-378). In this context, a wide variety of assays for 
observing perturbations in nucleotide and amino acid sequences are well known in the art. For 
example, the size and structure of nucleic acid or amino acid sequences of 20P2H8 gene 
products may be observed by the Northern, Southern, Western, PCR and DNA sequencing 
5 protocols discussed herein. In addition, other methods for observing perturbations in 
nucleotide and amino acid sequences such as single strand conformation polymorphism 
analysis are well known in the art (see, e.g., U.S. Patent Nos. 5,382,510 and 5,952,170). 

In another embodiment, one can examine the methylation status of the 20P2H8 gene 
in a biological sample. Aberrant demethylation and/or hypermethyktion of CpG islands in 

10 gene 5' regulatory regions frequendy occurs in immortalized and transformed cells and can 
result in altered expression of various genes. For example, promoter hypermethylation of the 
pi-class glutathione S-transferase (a protein expressed in normal prostate but not expressed 
in >90% of prostate carcinomas) appears to permanendy silence transcription of this gene 
and is the most frequendy detected genomic alteration in prostate carcinomas (De Marzo et 

15 aL, Am. J. Pathol. 155(6): 19854992 (1999)). In addition, this alteration is present in at least 
70% of cases of high-grade prostatic intraepithelial neoplasia (PIN) (Brooks et al, Cancer 
Epidemiol. Biomarkers Prev., 1998, 7:531-536). In another example, expression of the 
LAGE-I tumor specific gene (which is not expressed in normal prostate but is expressed in 
25-50% of prostate cancers) is induced by deoxy-azacytidine in lymphoblastoid cells, 

20 suggesting that tumoral expression is due to demethylation (Lethe et al., Int. J. Cancer 76(6): 
903-908 (1998)). In this context, a variety of assays for examining methylation status of a gene 
are well known in the art. For example, one can utilize in Southern hybridization approaches 
methylation-sensitive restriction enzymes which can not cleave sequences that contain 
methylated CpG sites in order to assess the overall methylation status of CpG islands. In 

25 addition, MSP (methylation specific PCR) can rapidly profile the methylation status of all the 
CpG sites present in a CpG island of a given gene. This procedure involves initial modification 
of DNA by sodium bisulfite (which will convert all unmethylated cytosines to uracil) followed 
by amplification using primers specific for methylated versus unmethylated DNA. Protocols 
involving methylation interference can also be found for example in Current Protocols In 

30 Molecular Biology, Units 12, Frederick M. Ausubul et al. eds., 1995. 

Gene amplification provides an additional method of assessing the status of 
20P2H8, a locus that maps to 15q22.32-23, a region shown to be perturbed in a variety of 
cancers. Gene amplification may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA 
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(Thomas, 1980, Proc. Natl. Acad. Sci. USA, 77:5201-5205), dot blotting (DNA analysis), or 
in situ hybridization, using an appropriately labeled probe, based on the sequences provided 
herein. Alternatively, antibodies may be employed that can recognize specific duplexes, 
including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein 
duplexes. The antibodies in turn may be labeled and the assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the 
presence of antibody bound to the duplex can be detected. 

In addition to the tissues discussed above, biopsied tissue or peripheral blood may be 
conveniently assayed for the presence of cancer cells, including but not limited to prostate, 
kidney, bladder, cervical, skin, stomach, testicular, ovarian, breast, pancreas, colon and lung 
cancers using for example, Northern, dot blot or RT-PCR analysis to detect 20P2H8 
expression (see e.g. FIGS 4-8). The presence of RT-PCR amplifiable 20P2H8 mRNA provides 
an indication of the presence of the cancer. RT-PCR detection assays for tumor cells in 
peripheral blood are currently being evaluated for use in the diagnosis and management of a 
number of human solid tumors. In the prostate cancer field, these include RT-PCR assays for 
the detection of cells expressing PSA and PSM (Verkaik et aL, 1997, Urol. Res. 25:373-384; 
Ghossein et aL, 1995, J. Clin. Oncol. 13:1195-2000; Heston et aL, 1995, Clin. Chem. 41:1687- 
1 688), RT-PCR assays are well known in the art. 

A related aspect of the invention is directed to predicting susceptibility to developing 
cancer in an individual. In one embodiment, a method for predicting susceptibility to cancer 
comprises detecting 20P2H8 mRNA or 20P2H8 protein in a tissue sample, its presence 
indicating susceptibility to cancer, wherein the degree of 20P2H8 mRNA expression present is 
proportional to the degree of susceptibility. In a specific embodiment, the presence of 20P2H8 
in prostate tissue is examined, with the presence of 20P2H8 in the sample providing an 
indication of prostate cancer susceptibility (or the emergence or existence of a prostate tumor). 
In another specific embodiment, the presence of 20P2H8 in tissue is examined, with the 
presence of 20P2H8 in the sample providing an indication of cancer susceptibility (or the 
emergence or existence of a tumor). In a closely related embodiment, one can evaluate the 
integrity 20P2H8 nucleotide and amino acid sequences in a biological sample in order to 
identify perturbations in the structure of these molecules such as insertions, deletions, 
substitutions and the like, with the presence of one or more perturbations in 20P2H8 gene 
products in the sample providing an indication of cancer susceptibility (or the emergence or 
existence of a tumor). 
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Yet another related aspect of the invention is directed to methods for gauging tumor 
aggressiveness. In one embodiment, a method for gauging aggressiveness of a tumor 
comprises determining the level of 20P2H8 mRNA or 20P2H8 protein expressed by cells in a 
sample of the tumor, comparing the level so determined to the level of 20P2H8 mRNA or 
5 20P2H8 protein expressed in a corresponding normal tissue taken from the same individual or 
a normal tissue reference sample, wherein the degree of 20P2H8 mRNA or 20P2H8 protein 
expression in the tumor sample relative to the normal sample indicates the degree of 
aggressiveness. In a specific embodiment, aggressiveness of a tumor is evaluated by 
determining the extent to which 20P2H8 is expressed in the tumor cells, with higher expression 
10 levels indicating more aggressive tumors. In a closely related embodiment, one can evaluate the 
integrity of 20P2H8 nucleotide and amino acid sequences in a biological sample in order to 
identify perturbations in the structure of these molecules such as insertions, deletions, 
substitutions and the like, with the presence of one or more perturbations indicating more 
aggressive tumors. 

15 Yet another related aspect of the invention is directed to methods for observing the 

progression of a malignancy in an individual over time. In one embodiment, methods for 
observing the progression of a malignancy in an individual over time comprise determining the 
level of 20P2H8 mRNA or 20P2H8 protein expressed by cells in a sample of the tumor, 
comparing the level so determined to the level of 20P2H8 mRNA or 20P2H8 protein 

20 expressed in an equivalent tissue sample taken from the same individual at a different time, 
wherein the degree of 20P2H8 mRNA or 20P2H8 protein expression in the tumor sample 
over time provides information on the progression of the cancer. In a specific embodiment, 
the progression of a cancer is evaluated by determining the extent to which 20P2H8 expression 
in the tumor cells alters over time, with higher expression levels indicating a progression of the 

25 cancer. In a closely related embodiment, one can evaluate the integrity 20P2H8 nucleotide and 
amino acid sequences in a biological sample in order to identify perturbations in the structure 
of these molecules such as insertions, deletions, substitutions and the like, with the presence of 
one or more perturbations indicating a progression of the cancer. 

The above diagnostic approaches may be combined with any one of a wide variety of 

30 prognostic and diagnostic protocols known in the art. For example, another embodiment of 
the invention disclosed herein is directed to methods for observing a coincidence between the 
expression of 20P2H8 gene and 20P2H8 gene products (or perturbations in 20P2H8 gene and 
20P2H8 gene products) and a factor that is associated with malignancy as a means of 
diagnosing and prognosticating the status of a tissue sample. In this context, a wide variety of 
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factors associated with malignancy may be utilized such as the expression of genes otherwise 
associated with maHgnancy (including PSA, PSCA and PSM expression) as weU as gross 
cytolo^cal observations (see e.g. Bocking et aL, 1984, Anal Quant. CytoL 6(2):74-88; Eptsein, 
1995, Hum. Pathol. 26(2):223-9; Thorson et aL, 1998, Mod. Pathol. 11(6):543-51; Baisden et 

5 aL, 1999, Am. J. Surg. Pathol. 23 (8) :9 18-24). Methods for observing a coincidence between 
the expression of 20P2H8 gene and 20P2H8 gene products (or perturbations in 20P2H8 gene 
and 20P2H8 gene products) and an additional factor that is associated with malignancy are 
useful, for example, because the presence of a set or constellation of specific factors that 
coincide provides information crucial for diagnosing and prognosticating the status of a tissue 

10 sample. 

In a typical embodiment, methods for observing a coincidence between the expression 
of 20P2H8 gene and 20P2H8 gene products (or perturbations in 20P2H8 gene and 20P2H8 
gene products) and a factor that is associated with malignancy entails detecting the 
overexpression of 20P2H8 mRNA or protein in a tissue sample, detecting the overexpression 

15 of PSA mRNA or protein in a tissue sample, and observing a coincidence of 20P2H8 mRNA 
or protein and PSA mRNA or protein overexpression. In a specific embodiment, the 
expression of 20P2H8 and PSA mRNA in prostate tissue is examined. In a preferred 
embodiment, the coincidence of 20P2H8 and PSA mRNA overexpression in the sample 
provides an indication of prostate cancer, prostate cancer susceptibility or the emergence or 

20 existence of a prostate tumor. 

Methods for detecting and quantifying the expression of 20P2H8 mRNA or protein 
are described herein and use of standard nucleic acid and protein detection and quantification 
technologies is well known in the art. Standard methods for the detection and quantification of 
20P2H8 mRNA include in situ hybridization using labeled 20P2H8 riboprobes, Northern blot 

25 and related techniques using 20P2H8 polynucleotide probes, RT-PCR analysis using primers 
specific for 20P2H8, and other amplification type detection methods, such as, for example, 
branched DNA, SISBA, TMA and the like. In a specific embodiment, semi-quantitative RT- 
PCR may be used to detect and quantify 20P2H8 mRNA expression as described in the 
Examples that follow. Any number of primers capable of amplifying 20P2H8 may be used for 

30 this purpose, including but not limited to the various primer sets specifically described herein. 
Standard methods for the detection and quantification of protein may be used for this purpose. 
In a specific embodiment, polyclonal or monoclonal antibodies specifically reactive with the 
wild-type 20P2H8 protein may be used in an immunohistochemical assay of biopsied tissue. 
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IDENTIFYING MOLECULES THAT INT ERACT WITH 20P2H8 

The 20P2H8 protein sequences disclosed herein allow the skilled artisan to identify 
proteins, small molecules and other agents that interact with 20P2H8 and pathways 
activated by 20P2H8 via any one of a variety of art accepted protocols. For example one 
can utilise, one of the variety of so-called interaction trap systems (also referred to as the 
"two-hybrid assay"). In such systems, molecules that interact reconstitute a transcription 
factor and direct expression of a reporter gene, the expression of which is then assayed. 
Typical systems identify protein-protein interactions in vivo through reconstitution of a 
eukaryotic transcriptional activator and are disclosed for example in U.S. Patent Nos. 
5,955,280, 5,925,523, 5,846,722 and 6,004,746. 

Alternatively one can identify molecules that interact with 20P2H8 protein sequences 
by screening peptide libraries. In such methods, peptides that bind to selected receptor 
molecules such as 20P2H8 are identified by screening libraries that encode a random or 
controlled collection of amino acids. Peptides encoded by the libraries are expressed as 
ftision proteins of bacteriophage coat proteins, and bacteriophage particles are then 
screened against the receptors of interest. 

Peptides having a wide variety of uses, such as therapeutic or diagnostic reagents, 
may thus be identified without any prior information on the structure of the expected ligand 
or receptor molecule. Typical peptide libraries and screening methods that can be used to 
identify molecules that interact with 20P2H8 protein sequences are disclosed for example in 
U.S. Patent Nos. 5,723,286 and 5,733,731. 

Alternatively, cell lines expressing 20P2H8 can be used to identify protein-protein 
interactions mediated by 20P2H8. This possibility can be examined using 
immunoprecipitation techniques as shown by others (Hamilton BJ, et al. Biochem. Biophys. 
Res. Commun. 1999, 261:646-51). Typically 20P2H8 protein can be immunoprecipitated 
from 20P2H8 expressing prostate cancer cell lines using anti-20P2H8 antibodies. 
Alternatively, antibodies against His-tag can be used in a cell line engineered to express 
20P2H8 (vectors mentioned above). The immunoprecipitated complex can be examined 
for protein association by procedures such as western blotting, 35 S-methionine labeling of 
proteins, protein microsequencing, silver staining and two dimensional gel electrophoresis. 

Small molecules that interact with 20P2H8 can be identified through related 
embodiments of such screening assays. For example, small molecules can be identified that 
interfere with protein function, including molecules that interfere with 20P2H8's ability to 
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mediate phosphorylation and de-phosphorylation, second messenger signaling and 
tumorigenesis. Typical methods are discussed for example in U.S. Patent No. 5,928,868 and 
include methods for forming hybrid ligands in which at least one ligand is a small molecule. 
In an illustrative embodiment, the hybrid ligand is introduced into cells that in turn contain 

5 a first and a second expression vector. Each expression vector includes DNA for 
expressing a hybrid protein that encodes a target protein linked to a coding sequence for a 
transcriptional module. The cells further contains a reporter gene, the expression of which 
is conditioned on the proximity of the first and second hybrid proteins to each other, an 
event that occurs only if the hybrid ligand binds to target sites on both hybrid proteins. 

10 Those cells that express the reporter gene are selected and the unknown small molecule or 
the unknown hybrid protein is identified. 

A typical embodiment of this invention consists of a method of screening for a 
molecule that interacts with a 20P2H8 amino acid sequence shown in FIG. 1 (SEQ ID NO: 
2), comprising the steps of contacting a population of molecules with the 20P2H8 amino 

15 acid sequence, allowing the population of molecules and the 20P2H8 amino acid sequence 
to interact under conditions that facilitate an interaction, determining the presence of a 
molecule that interacts with the 20P2H8 amino acid sequence and then separating molecules 
that do not interact with the 20P2H8 amino acid sequence from molecules that do interact 
with the 20P2H8 amino acid sequence. In a specific embodiment, the method further 

20 includes purifying a molecule that interacts with the 20P2H8 amino acid sequence. In a 
preferred embodiment, the 20P2H8 amino acid sequence is contacted with a library of 
peptides. 

THERAPEUTIC METHODS AND COMPOSITIONS 

25 The identification of 20P2H8 as a gene that is highly expressed in cancers of the 

prostate (and possibly other cancers), opens a number of therapeutic approaches to the 
treatment of such cancers. As discussed above, it is possible that 20P2H8 is secreted from 
cancer cells and in this way modulates proliferation signals. Its potential role as a 
transcription factor and its high expression in prostate cancer makes it a potential target for 

30 small molecule-mediated therapy. 

Accordingly, therapeutic approaches aimed at inhibiting the activity of the 20P2H8 
protein are expected to be useful for patients suffering from prostate cancer, testicular 
cancer, and other cancers expressing 20P2H8. These therapeutic approaches aimed at 
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inhibiting the activity of the 20P2H8 protein generally fall into two classes. One class 
comprises various methods for inhibiting the binding or association of the 20P2H8 protein 
with its binding partner or with other proteins. Another class comprises a variety of 
methods for inhibiting the transcription of the 20P2H8 gene or translation of 20P2H8 
5 mRNA. 



20P2H8 as a Target for Antibody-Based Therapy 

10 The structural features of 20P2H8 indicate that this molecule is an attractive target 

for antibody-based therapeutic strategies. A number of typical antibody strategies are 
known in the art for targeting both extracellular and intracellular molecules (see e.g. 
complement and ADCC mediated killing as well as the use of intrabodies discussed below). 
Because 20P2H8 is expressed by cancer cells of various lineages and not by corresponding 

15 normal cells, systemic administration of 20P2H8-immunoreactive compositions would be 
expected to exhibit excellent sensitivity without toxic, non-specific and/ or non-target effects 
caused by binding of the immunotherapeutic molecule to non-target organs and tissues. 
Antibodies specifically reactive with domains of 20P2H8 can be useful to treat 20P2H8- 
expressing cancers systemically, either as conjugates with a toxin or therapeutic agent, or as 

20 naked antibodies capable of inhibiting cell proliferation or function. 

20P2H8 antibodies can be introduced into a patient such that the antibody binds to 
20P2H8 and modulates or perturbs a function such as an interaction with a binding partner 
and consequently mediates the growth inhibition and/ or destruction of the cells and the 
tumor and/or inhibits the growth of the cells or the tumor. Mechanisms by which such 

25 antibodies exert a therapeutic effect may include complement-mediated cytolysis, antibody- 
dependent cellular cytotoxicity, modulating the physiological function of 20P2H8, inhibiting 
ligand binding or signal transduction pathways, modulating tumor cell differentiation, 
altering tumor angiogenesis factor profiles, and/ or by inducing apoptosis. 20P2H8 
antibodies can be conjugated to toxic or therapeutic agents and used to deliver the toxic or 

30 therapeutic agent directly to 20P2H8-bearing tumor cells. Examples of toxic agents include, 
but are not limited to, calchemicin, maytansinoids, radioisotopes such as 131 1, ytrium, and 
bismuth. 

Cancer immunotherapy using anti-20P2H8 antibodies may follow the teachings 
generated from various approaches that have been successfully employed in the treatment 
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of other types of cancer, including but not limited to colon cancer (Aden et al., 1998, Grit. 
Rev. Immunol. 18:133438), multiple myeloma (Ozaki et al, 1997, Blood 90:3179-3186; 
Tsunenari et al., 1997, Blood 90:2437-2444), gastric cancer (Kasprzyk et al., 1992, Cancer 
Res. 52:2771-2776), B-cell lymphoma (Funakoshi et al., 1996, J. Immunother. Emphasis 
5 Tumor Immunol. 19:93-101), leukemia (Zhong et al., 1996, Leuk. Res. 20:581-589), 
colorectal cancer (Moun et al., 1994, Cancer Res. 54:6160-6166; Velders et al., 1995, Cancer 
Res. 55:4398-4403), and breast cancer (Shepard et al., 1991, J. Clin. Immunol. 11:117-127). 
Some therapeutic approaches involve conjugation of naked antibody to a toxin, such as the 
conjugation of 131 I to anti-CD20 antibodies (e.g., Rituxan™, ID EC Pharmaceuticals Corp.), 

10 while others involve co-administration of antibodies and other therapeutic agents, such as 
Herceptin™ (trastuzumab) with paclitaxel (Genentech, Inc.). For treatment of prostate 
cancer, for example, 20P2H8 antibodies can be administered in conjunction with radiation, 
chemotherapy or hormone ablation. 

Although 20P2H8 antibody therapy may be useful for all stages of cancer, antibody 

15 therapy may be particularly appropriate in advanced or metastatic cancers. Treatment with 
the antibody therapy of the invention may be indicated for patients who have received 
previously one or more chemotherapy, while combining the antibody therapy of the 
invention with a chemotherapeutic or radiation regimen may be preferred for patients who 
have not received chemotherapeutic treatment. Additionally, antibody therapy may enable 

20 the use of reduced dosages of concomitant chemotherapy, particularly for patients who do 
not tolerate the toxicity of the chemotherapeutic agent very well. 

It may be desirable for some cancer patients to be evaluated for the presence and 
level of 20P2H8 expression, preferably using immunohistochemical assessments of tumor 
tissue, quantitative 20P2H8 imaging, or other techniques capable of reliably indicating the 

25 presence and degree of 20P2H8 expression. Immunohistochemical analysis of tumor 
biopsies or surgical specimens may be preferred for this purpose. Methods for 
immunohistochemical analysis of tumor tissues are well known in the art 

Anti-20P2H8 monoclonal antibodies useful in treating prostate and other cancers 
include those that are capable of initiating a potent immune response against the tumor and 

30 those that are capable of direct cytotoxicity. In this regard, anti-20P2H8 monoclonal 
antibodies (mAbs) may elicit tumor cell lysis by either complement-mediated or antibody- 
dependent cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc portion 
of the immunoglobulin molecule for interaction with effector cell Fc receptor sites or 
complement proteins. In addition, anti-20P2H8 mAbs that exert a direct biological effect 
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on tumor growth are useful in the practice of the invention. Potential mechanisms by which 
such directly cytotoxic mAbs may act include inhibition of cell growth, modulation of 
cellular differentiation, modulation of tumor angiogenesis factor profiles, and the induction 
of apoptosis. The mechanism by which a particular anti-20P2H8 mAb exerts an anti-tumor 
5 effect may be evaluated using any number of in vitro assays designed to determine ADCC, 
ADMMC, complement-mediated cell lysis, and so forth, as is generally known in the art. 

The use of murine or other non-human monoclonal antibodies, or human/mouse 
chimeric mAbs may induce moderate to strong immune responses in some patients. In 
some cases, this will result in clearance of the antibody from circulation and reduced 

10 efficacy. In the most severe cases, such an immune response may lead to the extensive 
formation of immune complexes which, potentially, can cause renal failure. Accordingly, 
preferred monoclonal antibodies used in the practice of the therapeutic methods of the 
invention are those that are either fully human or humanized and that bind specifically to 
the target 20P2H8 antigen with high affinity but exhibit low or no antigenicity in the patient. 

15 Therapeutic methods of the invention contemplate the administration of single anti- 

20P2H8 mAbs as well as combinations, or cocktails, of different mAbs. Such mAb cocktails 
may have certain advantages inasmuch as they contain mAbs that target different epitopes, 
exploit different effector mechanisms or combine directly cytotoxic mAbs with mAbs that 
rely on immune effector functionality. Such mAbs in combination may exhibit synergistic 

20 therapeutic effects. In addition, the administration of anti-20P2H8 mAbs may be combined 
with other therapeutic agents, including but not limited to various chemotherapeutic agents, 
androgen-blockers, and immune modulators (e.g., IL-2, GM-CSF). The anti-20P2H8 mAbs 
may be administered in their "naked" or unconjugated form, or may have therapeutic agents 
conjugated to them. 

25 The anti-20P2H8 antibody formulations may be administered via any route capable 

of delivering the antibodies to the tumor site. Potentially effective routes of administration 
include, but are not limited to, intravenous, intraperitoneal, intramuscular, intratumor, 
intradermal, and the like. Treatment will generally involve the repeated administration of 
the anti-20P2H8 antibody preparation via an acceptable route of administration such as 

30 intravenous injection (IV), typically at a dose in the range of about 0.1 to about 10 mg/kg 
body weight Doses in the range of 10-500 mg mAb per week may be effective and well 
tolerated. 

Based on clinical experience with the Herceptin mAb in the treatment of metastatic 
breast cancer, an initial loading dose of approximately 4 mg/kg patient body weight IV 
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followed by weekly doses of about 2 mg/kg IV of the anti- 20P2H8 mAb preparation may 
represent an acceptable dosing regimen. Preferably, the initial loading dose is administered 
as a 90 minute or longer infusion. The periodic maintenance dose may be administered as a 
30 minute or longer infusion, provided the initial dose was well tolerated. However, as one 
5 of skill in the art will understand, various factors will influence the ideal dose regimen in a 
particular case. Such factors may include, for example, the binding affinity and half life of 
the Ab or mAbs used, the degree of 20P2H8 expression in the patient, the extent of 
circulating shed 20P2H8 antigen, the desired steady-state antibody concentration level, 
frequency of treatment, and the influence of chemotherapeutic agents used in combination 

10 with the treatment method of the invention. 

Optimally, patients should be evaluated for the level of circulating shed 20P2H8 
antigen in serum in order to assist in the determination of the most effective dosing regimen 
and related factors. Such evaluations may also be used for monitoring purposes throughout 
therapy, and may be useful to gauge therapeutic success in combination with evaluating 

15 other parameters (such as serum PSA levels in prostate cancer therapy). 

Inhibition of 20P2H8 Protein Function 

The invention includes various methods and compositions for inhibiting the binding 
of 20P2H8 to its binding partner or ligand, or its association with other protein(s) as well as 
20 methods for inhibiting 20P2H8 function. 

Inhibition of 20P2H8 With Intracellular Antibodies 

In one approach, recombinant vectors encoding single chain antibodies that 
specifically bind to 20P2H8 may be introduced into 20P2H8 expressing cells via gene 

25 transfer technologies, wherein the encoded single chain anti-20P2H8 antibody is expressed 
intracellularly, binds to 20P2H8 protein, and thereby inhibits its function. Methods for 
engineering such intracellular single chain antibodies are well known. Such intracellular 
antibodies, also known as "intrabodies", may be specifically targeted to a particular 
compartment within the cell, providing control over where the inhibitory activity of the 

30 treatment will be focused. This technology has been successfully applied in the art (for 
review, see Richardson and Marasco, 1995, TIBTECH vol. 13). Intrabodies have been 
shown to virtually eliminate the expression of otherwise abundant cell surface receptors. 
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See, for example, Richardson et al, 1995, Proc, Nad Acad. Sci. USA 92: 3137-3141; Beerli 
et al., 1994, J. Biol. Chem. 289: 23931-23936; Deshane et al., 1994, Gene Ther. 1: 332-337. 

Single chain antibodies comprise the variable domains of the heavy and light chain 
joined by a flexible linker polypeptide, and are expressed as a single polypeptide. Optionally, 
5 single chain antibodies may be expressed as a single chain variable region fragment joined to 
the light chain constant region. Well known intracellular trafficking signals may be 
engineered into recombinant polynucleotide vectors encoding such single chain antibodies 
in order to precisely target the expressed intrabody to the desired intracellular compartment. 
For example, intrabodies targeted to the endoplasmic reticulum (ER) may be engineered to 

10 incorporate a leader peptide and, optionally, a C-terminal ER retention signal, such as the 
KDEL amino acid motif. Intrabodies intended to exert activity in the nucleus may be 
engineered to include a nuclear localization signal. Lipid moieties may be joined to 
intrabodies in order to tether the intrabody to the cytosolic side of the plasma membrane. 
Intrabodies may also be targeted to exert function in the cytosol. For example, cytosolic 

15 intrabodies may be used to sequester factors within the cytosol, thereby preventing them 
from being transported to their natural cellular destination. 

In one embodiment, intrabodies may be used to capture 20P2H8 in the nucleus, 
thereby preventing its activity within the nucleus. Nuclear targeting signals may be 
engineered into such 20P2H8 intrabodies in order to achieve the desired targeting. Such 

20 20P2H8 intrabodies may be designed to bind specifically to a particular 20P2H8 domain. 
In another embodiment, cytosolic intrabodies that specifically bind to the 20P2H8 protein 
may be used to prevent 20P2H8 from gaining access to the nucleus, thereby preventing it 
from exerting any biological activity within the nucleus (e.g., preventing 20P2H8 from 
forming transcription complexes with other factors). 

25 In order to specifically direct the expression of such intrabodies to particular tumor 

cells, the transcription of the intrabody may be placed under the regulatory control of an 
appropriate tumor-specific promoter and/or enhancer. In order to target intrabody 
expression specifically to prostate, for example, the PSA promoter and/or 
promoter/ enhancer may be utilized (See, for example, U.S. Patent No. 5,919,652). 

30 

Inhibition of 20P2H8 With Recombinant Proteins 

In another approach^ recombinant molecules that are capable of binding to 20P2H8 
thereby preventing 20P2H8 from accessing/binding to its binding partner(s) or associating 
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with other protein(s) are used to inhibit 20P2H8 function. Such recombinant molecules 
may, for example, contain the reactive part(s) of a 20P2H8 specific antibody molecule. In a 
particular embodiment, the 20P2H8 binding domain of a 20P2H8 binding partner may be 
engineered into a dimeric fusion protein comprising two 20P2H8 ligand binding domains 
5 linked to the Fc portion of a human IgG, such as human IgGl. Such IgG portion may 
contain, for example, the C H 2 and C H 3 domains and the hinge region, but not the C H 1 domain. 
Such dimeric fusion proteins may be administered in soluble form to patients suffering from a 
cancer associated with the expression of 20P2H8, including but not limited to prostate, 
bladder, testicular, ovarian, breast, pancreas, colon and lung cancers, where the dimeric 
10 fusion protein specifically binds to 20P2H8 thereby blocking 20P2H8 interaction with a 
binding partner. Such dimeric fusion proteins may be further combined into multimeric 
proteins using known antibody linking technologies. 

Inhibition of 20P2H8 Transcription or Translation 

15 Within another class of therapeutic approaches, the invention provides various 

methods and compositions for inhibiting the transcription of the 20P2H8 gene. Similarly, 
the invention also provides methods and compositions for inhibiting the translation of 
20P2H8 mRNA into protein. 

In one approach, a method of inhibiting the transcription of the 20P2H8 gene 

20 comprises contacting the 20P2H8 gene with a 20P2H8 antisense polynucleotide. In another 
approach, a method of inhibiting 20P2H8 mRNA translation comprises contacting the 
20P2H8 mRNA with an antisense polynucleotide. In another approach, a 20P2H8 specific 
ribozyme may be used to cleave the 20P2H8 message, thereby inhibiting translation. Such 
antisense and ribozyme based methods may also be directed to the regulatory regions of the 

25 20P2H8 gene, such as the 20P2H8 promoter and/or enhancer elements. Similarly, proteins 
capable of inhibiting a 20P2H8 gene transcription factor may be used to inhibit 20P2H8 
mRNA transcription. The various polynucleotides and compositions useful in the 
aforementioned methods have been described above. The use of antisense and ribozyme 
molecules to inhibit transcription and translation is well known in the art. 

30 Other factors that inhibit the transcription of 20P2H8 through interfering with 

20P2H8 transcriptional activation may also be useful for the treatment of cancers expressing 
20P2H8. Similarly, factors that are capable of interfering with 20P2H8 processing may be 
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useful for the treatment of cancers expressing 20P2H8. Cancer treatment methods utilizing 
such factors are also within the scope of the invention. 

General Considerations for Therapeutic Strategies 

5 Gene transfer and gene therapy technologies may be used for delivering therapeutic 

polynucleotide molecules to tumor cells synthesizing 20P2H8 (ie., antisense, ribozyme, 
polynucleotides encoding intrabodies and other 20P2H8 inhibitory molecules). A number of 
gene therapy approaches are known in the art. Recombinant vectors encoding 20P2H8 
antisense polynucleotides, ribozymes, factors capable of interfering with 20P2H8 transcription, 

10 and so forth, may be delivered to target tumor cells using such gene therapy approaches. 

The above therapeutic approaches may be combined with any one of a wide variety of 
chemotherapy or radiation therapy regimens. These therapeutic approaches may also enable 
the use of reduced dosages of chemotherapy and/or less frequent administration, particularly in 
patients that do not tolerate the toxicity of the chemotherapeutic agent well. 

15 The anti-tumor activity of a particular composition (e.g., antisense, ribozyme, 

intrabody), or a combination of such compositions, may be evaluated using various in vitro and 
in vivo assay systems. In vitro assays for evaluating therapeutic potential include cell growth 
assays, soft agar assays and other assays indicative of tumor promoting activity, binding assays 
capable of determining the extent to which a therapeutic composition will inhibit the binding 

20 of 20P2H8 to a binding partner, etc. 

In vivo, the effect of a 20P2H8 therapeutic composition may be evaluated in a suitable 
animal model. For example, xenogenic prostate cancer models wherein human prostate cancer 
explants or passaged xenograft tissues are introduced into immune compromised animals, such 
as nude or SCID mice, are appropriate in relation to prostate cancer and have been described 

25 (Klein et al., 1997, Nature Medicine 3:402-408). For example, PCT Patent Application 
W098/16628, Sawyers et al., published April 23, 1998, describes various xenograft models of 
human prostate cancer capable of recapitulating the development of primary tumors, 
micrometastasis, and the formation of osteoblastic metastases characteristic of late stage 
disease. Efficacy may be predicted using assays that measure inhibition of tumor formation, 

30 tumor regression or metastasis, and the like. See, also, the Examples below. 

In vivo assays that qualify the promotion of apoptosis may also be useful in 
evaluating potential therapeutic compositions. In one embodiment, xenografts from 
bearing mice treated with the therapeutic composition may be examined for the presence of 
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apoptotic foci and compared to untreated control xenograft-bearing mice. The extent to 
which apoptotic foci are found in the tumors of the treated mice provides an indication of 
the therapeutic efficacy of the composition. 

The therapeutic compositions used in the practice of the foregoing methods may be 
formulated into pharmaceutical compositions comprising a carrier suitable for the desired 
delivery method. Suitable carriers include any material that when combined with the 
therapeutic composition retains the anti-tumor function of the therapeutic composition and 
is non-reactive with the patient's immune system. Examples include, but are not limited to, 
any of a number of standard pharmaceutical carriers such as sterile phosphate buffered 
saline solutions, bacteriostatic water, and the like (see, generally, Remington's 
Pharmaceutical Sciences 16 th Ed., A. Osal., Ed., 1980). 

Therapeutic formulations may be solubilized and administered via any route capable 
of delivering the therapeutic composition to the tumor site. Potentially effective routes of 
administration include, but are not limited to, intravenous, parenteral, intraperitoneal, 
intramuscular, intratumor, intradermal, inttaorgan, orthotopic, and the like. A preferred 
formulation for intravenous injection comprises the therapeutic composition in a solution of 
preserved bacteriostatic water, sterile unpreserved water, and/or diluted in polyvinylchloride 
or polyethylene bags containing 0.9% sterile Sodium Chloride for Injection, USP. 
Therapeutic protein preparations may be lyophilized and stored as sterile powders, 
preferably under vacuum, and then reconstituted in bacteriostatic water containing, for 
example, benzyl alcohol preservative, or in sterile water prior to injection. 

Dosages and administration protocols for the treatment of cancers using the foregoing 
methods will vary with the method and the target cancer and will generally depend on a 
number of other factors appreciated in the art, 

CANCER VACCINES 

As noted above, the expression profile of 20P2H8 shows that it is highly expressed 
in advanced and metastasized prostate cancer. This expression pattern is reminiscent of the 
Cancer-Testis (CT) antigens or MAGEs, which are testis-specific genes that are up-regulated 
in melanomas and other cancers (Van den Eynde and Boon, Int J Clin Lab Res. 27:81-86, 
1997). Due to their tissue-specific expression and high expression levels in cancer, the 
MAGEs are currently being investigated as targets for cancer vaccines (Durrant, Anticancer 
Drugs 8:727-733, 1997; Reynolds et al., Int J Cancer 72:972-976, 1997). 
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The invention further provides cancer vaccines comprising a 20P2H8 protein or 
fragment thereof, as well as DNA based vaccines. In view of the expression of 20P2H8 cancer 
vaccines are expected to be effective at specifically preventing and/or treating 20P2H8 
expressing cancers without creating non-specific effects on non-target tissues. The use of a 
5 tumor antigen in a vaccine for generating humoral and cell-mediated immunity for use in anti- 
cancer therapy is well known in the art and has been employed in prostate cancer using human 
PSMA and rodent PAP immunogens (Hodge et aL, 1995, Int. J. Cancer 63:231-237; Fong et aL, 
1997, J. Immunol. 159:3113-3117). Such methods can be readily practiced by employing a 
20P2H8 protein, or fragment thereof, or a 20P2H8-encoding nucleic acid molecule and 

10 recombinant vectors capable of expressing and appropriately presenting the 20P2H8 
immunogen. An illustrative example of a typical technique consists of a method of generating 
an immune response (e.g. a humoral response) in a mammal comprising the steps exposing the 
mammal's immune system to an immunoreactive epitope (e.g. an epitope of the 20P2H8 
protein shown in SEQ ID NO: 2) so that the mammal generates an immune response that is 

15 specific for that epitope is generated (e.g. antibodies that specifically recognize that epitope). 

For example, viral gene delivery systems may be used to deliver a 20P2H8-encoding 
nucleic acid molecule. Various viral gene delivery systems that can be used in the practice of 
this aspect of the invention include, but are not limited to, vaccinia, fowlpox, canarypox, 
adenovirus, influenza, poliovirus, adeno-associated virus, lentivirus, and sindbus virus (Restifo, 

20 1996, Curr. Opin. Immunol. 8:658-663). Non-viral delivery systems may also be employed by 
using naked DNA encoding a 20P2H8 protein or fragment thereof introduced into the patient 
(e.g., intramuscularly) to induce an anti-tumor response. In one embodiment, the full-length 
human 20P2H8 cDNA may be employed. In another embodiment, 20P2H8 nucleic acid 
molecules encoding specific cytotoxic T lymphocyte (CTL) epitopes may be employed. CTL 

25 epitopes can be determined using specific algorithms (e.g., Epimer, Brown University) to 
identify peptides within a 20P2H8 protein that are capable of optimally binding to specified 
HLA alleles. 

Various ex vivo strategies may also be employed. One approach involves the use of 
dendritic cells to present 20P2H8 antigen to a patient's immune system. Dendritic cells express 
30 MHC class I and II, B7 co-stimulator, and IL-12, and are thus highly specialized antigen 
presenting cells. In prostate cancer, autologous dendritic cells pulsed with peptides of the 
prostate-specific membrane antigen (PSMA) are being used in a Phase I clinical trial to 
stimulate prostate cancer patients' immune systems (Tjoa et aL, 1996, Prostate 28:65-69; 
Murphy et aL, 1996, Prostate 29:371-380). Dendritic cells can be used to present 20P2H8 
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peptides to T cells in the context of MHC class I and II molecules. In one embodiment, 
autologous dendritic cells are pulsed with 20P2H8 peptides capable of binding to MHC 
molecules. In another embodiment, dendritic cells are pulsed with the complete 20P2H8 
protein. Yet another embodiment involves engineering the overexpression of the 20P2H8 
5 gene in dendritic cells using various implementing vectors known in the art, such as 
adenovirus (Arthur et al., 1997, Cancer Gene Ther. 4:17-25), retrovirus (Henderson et al., 
1996, Cancer Res. 56:3763-3770), lentivirus, adeno-associated virus, DNA transfection 
(Ribas et aL, 1997, Cancer Res. 57:2865-2869), and tumor-derived RNA transfection (Ashley 
et al., 1997, J. Exp. Med. 186:1177-1182). Cells expressing 20P2H8 may also be engineered 

10 to express immune modulators, such as GM-CSF, and used as immunizing agents. 

Anti-idiotypic anti-20P2H8 antibodies can also be used in anti-cancer therapy as a 
vaccine for inducing an immune response to cells expressing a 20P2H8 protein. Specifically, 
the generation of anti-idiotypic antibodies is well known in the art and can readily be adapted to 
generate anti-idiotypic anti-20P2H8 antibodies that mimic an epitope on a 20P2H8 protein 

15 (see, for example, Wagner et aL, 1997, Hybridoma 16: 33-40; Foon et al, 1995, J. Clin. Invest 
96:334-342; Herlyn et al., 1996, Cancer Immunol. Immunother. 43:65-76). Such an anti- 
idiotypic antibody can be used in cancer vaccine strategies. 

Genetic immunization methods may be employed to generate prophylactic or 
therapeutic humoral and cellular immune responses directed against cancer cells expressing 

20 20P2H8. Constructs comprising DNA encoding a 20P2H8 protein/immunogen and 
appropriate regulatory sequences may be injected directly into muscle or skin of an individual, 
such that the cells of the muscle or skin take-up the construct and express the encoded 
20P2H8 protein/immunogen. Expression of the 20P2H8 protein immunogen results in the 
generation of prophylactic or therapeutic humoral and cellular immunity against bone, colon, 

25 pancreatic, testicular, cervical and ovarian cancers. Various prophylactic and therapeutic 
genetic immunization techniques known in the art may be used (for review, see information 
and references published at Internet address www.genweb.com). 

KITS 

30 For use in the diagnostic and therapeutic applications described or suggested above, 

kits are also provided by the invention. Such kits may comprise a carrier means being 
compartmentalized to receive in close confinement one or more container means such as 
vials, tubes, and the like, each of the container means comprising one of the separate 
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elements to be used in the method. For example, one of the container means may comprise 
a probe that is or can be detectably labeled. Such probe may be an antibody or 
polynucleotide specific for a 20P2H8 protein or a 20P2H8 gene or message, respectively. 
Where the kit utilizes nucleic acid hybridization to detect the target nucleic acid, the kit may 
also have containers containing nucleotide(s) for amplification of the target nucleic acid 
sequence and/or a container comprising a reporter-means, such as a biotin-binding protein, 
such as avidin or streptavidin, bound to a reporter molecule, such as an enzymatic, 
florescent, or radioisotope label. 

The kit of the invention will typically comprise the container described above and one 
or more other containers comprising materials desirable from a commercial and user 
standpoint, including buffers, diluents, filters, needles, syringes, and package inserts with 
instructions for use. A label may be present on the container to indicate that the composition 
is used for a specific therapy or non-therapeutic application, and may also indicate directions 
for either in vivo or in vitro use, such as those described above. 

EXAMPLES 

Various aspects of the invention are further described and illustrated by way of the 
several examples which follow, none of which are intended to limit the scope of the 
invention. 



EXAMPLE 1: 

SSH-GKNERATED I SOLATION OF cDNA FRAGMENT OF T HE 20P2H8 GENE 
MATERIALS AND METHODS: 
LAPC Xenografts and Human Tissues : 

25 LAPC xenografts were obtained from Dr. Charles Sawyers (UCLA) and generated as 

described (Klein et al, 1997, Nature Med. 3: 402-408). Androgen dependent and 
independent LAPC-4 xenografts LAPC-4 AD and AI, respectively) and LAPC-9 AD and AI 
xenografts were grown in male SCID mice and were passaged as small tissue chunks in 
recipient males. LAPC-4 and -9 AI xenografts were derived from LAPC-4 or -9 AD 

30 tumors, respectively. Male mice bearing AD tumors were castrated and maintained for 2-3 
months. After the tumors re-grew, the tumors were harvested and passaged in castrated 
males or in female SCID mice. Human tissues for RNA and protein analyses were obtained 
from the Human Tissue Resource Center (HTRC) at the UCLA (Los Angeles, CA) and 
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from QualTek, Inc. (Santa Barbara, CA). A benign prostatic hyperplasia tissue sample was 
patient-derived. 

Cell Lines : 

Human cell lines (e.g., HeLa) were obtained from the ATCC and were maintained 
in DMEM with 5% fetal calf serum. 

RNA Isolation : 

Tumor tissue and cell lines were homogenized in Trizol reagent (Life Technologies, 
Gibco BRL) using 10 ml/ g tissue or 10 ml/ 10 8 cells to isolate total RNA. Poly A RNA was 
purified from total RNA using Qiagen's Oligotex mRNA Mini and Midi kits. Total and 
mRNA were quantified by spectrophotometric analysis (O.D. 260/280 nm) and analyzed by 
gel electrophoresis. 

Oligonucleotides : 

The following HPLC purified oligonucleotides were used. 
DPNCDN (cDNA synthesis primer) : 
5TirrGATCAAGCTT 30 3' (SEQ ID NO: 3) 

Adaptor 1 : 

5 ? CTAATACGACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAG3 ? (SEQ ID 
NO: 4) 

3'GGCCCGTCCTAG5 ? (SEQ ID NO: 5) 

Adaptor 2 : 

5 ? GTAATACGACTCACTATAGGGCAGCGTGGTCGCGGCCGAG3 ? (SEQ ID 
NO: 6) 

3'CGGCTCCTAG5' (SEQ ID NO: 7) 

PCR primer 1 : 

5 ? CTAATACGACTCACTATAGGGC3 ? (SEQ ID NO: 8) 
Nested primer (NP^l : 

5TCGAGCGGCCGCCCGGGCAGGA3' (SEQ ID NO: 9) 
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Nested primer (NP^2 : 

5'AGCGTGGTCGCGGCCGAGGA3 5 (SEQ ID NO: 10) 

Su ppression Subtractive Hybridization : 

Suppression Subtractive Hybridization (SSH) was used to identify cDNAs 
corresponding to genes which may be differentially expressed in prostate cancer. The SSH 
reaction utilized cDNA from an androgen-dependent human prostate cancer xenograft 
originally derived from a prostate cancer lymph node metastasis (LAPC-4 AD) and cDNA 
derived from human benign hyperplasia (BPH) prostate tissue, wherein the LAPC-4 AD 
xenograft was used as the source of the "tester" cDNA, and the BPH tissue was used as the 
source of the "driver" cDNA. 

Double stranded cDNAs corresponding to tester and driver cDNAs were 
synthesized from 2 (ig of poly(A) + RNA isolated from the relevant xenograft tissue, as 
described above, using CLONTECH's PCR-Select cDNA Subtraction Kit and 1 ng of 
oligonucleotide DPNCDN as primer. First- and second-strand synthesis were carried out as 
described in the Kit's user manual protocol (CLONTECH Protocol No. PT1117-1, Catalog 
No. K1804-1). The resulting cDNA was digested with Dpn II for 3 hrs. at 37°C Digested 
cDNA was extracted with phenol/ chloroform (1:1) and ethanol precipitated. 

Driver cDNA was generated by combining Dpn II digested cDNA from BPH with 
a mix of digested cDNAs derived from the human cell lines HeLa, 293, A431, Colo205, and 
mouse liver. 

Tester cDNA was generated by diluting 1 |ll of Dpn II digested cDNA from the 
relevant xenograft source (see above) (400 ng) in 5 [il of water. The diluted cDNA (2 
160 ng) was then ligated to 2 jLil of Adaptor 1 and Adaptor 2 (10 jaM), in separate ligation 
reactions, in a total volume of 10 |il at 16°C overnight, using 400 u of T4 DNA ligase 
(CLONTECH). Ligation was terminated with 1 \xl of 0.2 M EDTA and heating at 72°C for 
5 min. 

The first hybridization was performed by adding 1.5 [il (600 ng) of driver cDNA to 
each of two tubes containing 1.5 jal (20 ng) Adaptor 1- and Adaptor 2- ligated tester cDNA. 
In a final volume of 4 |ul, the samples were overlaid with mineral oil, denatured in an MJ 
Research thermal cycler at 98°C for 1.5 minutes, and then were allowed to hybridize for 8 
hrs at 68°C. The two hybridizations were then mixed together with an additional 1 [il of 
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fresh denatuted driver cDNA and were allowed to hybridize overnight at 68°C. The second 
hybridization was then diluted in 200 \xl of 20 mM Hepes, pH 8.3, 50 mM NaCl, 0.2 mM 
EDTA, heated at 70°C for 7 min. and stored at -20°C. 

5 PCR Amplification. Cloning and Sequencing of Gene Fragments Generated from SSH : 

To amplify gene fragments resulting from SSH reactions, two PCR amplications 
were performed. In the primary PCR reaction 1 (il of the diluted final hybridization mix 
was added to 1 juil of PCR primer 1 (10 |uM), 0.5 jal dNTP mix (10 [iM) y 2.5 fil 10 x reaction 
buffer (CLONTECH) and 0.5 ul 50 x Advantage cDNA polymerase Mix (CLONTECH) in 

10 a final volume of 25 PCR 1 was conducted using the following conditions: 75°C for 5 
min., 94°C for 25 sec, then 27 cycles of 94°C for 10 sec, 66°C for 30 sec, 72°C for 1.5 min. 
Five separate primary PCR reactions were performed for each experiment. The products 
were pooled and diluted 1:10 with water. For the secondary PCR reaction, 1 pi from the 
pooled and diluted primary PCR reaction was added to the same reaction mix as used for 

15 PCR 1, except that primers NP1 and NP2 (10 \\M) were used instead of PCR primer 1. 
PCR 2 was performed using 10-12 cycles of 94°C for 10 sec, 68°C for 30 sec, 72°C for 1.5 
minutes. The PCR products were analyzed using 2% agarose gel electrophoresis. 

The PCR products were inserted into pCR2.1 using the T/A vector cloning kit 
(Invitrogen). Transformed E. coli were subjected to blue/white and ampicillin selection. 

20 White colonies were picked and arrayed into 96 well plates and were grown in liquid culture 
overnight To identify inserts, PCR amplification was performed on 1 ml of bacterial culture 
using the conditions of PCR1 and NP1 and NP2 as primers. PCR products were analyzed 
using 2% agarose gel electrophoresis. 

Bacterial clones were stored in 20% glycerol in a 96 well format. Plasmid DNA was 

25 prepared, sequenced, and subjected to nucleic acid homology searches of the GenBank, 
dBest, and NCI-CGAP databases. 

RT-PCR Expression Analysis : 

First strand cDNAs were generated from 1 (ig of mRNA with oligo (dT)12-18 
30 priming using the Gibco-BRL Superscript Preamplification system. The manufacturers 
protocol was used and included an incubation for 50 min at 42°C with reverse transcriptase 
followed by RNAse H treatment at 37°C for 20 min. After completing the reaction, the 
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volume was increased to 200 |il with water prior to normalization. First strand cDNAs 
from 16 different normal human tissues were obtained from Clontech. 

Normalization of the first strand cDNAs from multiple tissues was performed by 
using the primers 5'atatcgccgcgctcgtcgtcgacaa3' (SEQ ID NO: 11) and 

5 5'agccacacgcagctcattgtagaagg 3' (SEQ ID NO: 12) to amplify (3-actin. First strand cDNA (5 
pi) was amplified in a total volume of 50 jLxl containing 0.4 uM primers, 0.2 \xM each 
dNTPs, 1XPCR buffer (Clontech, 10 mM Tris-HCL, 1.5 mM MgCl 2 , 50 mM KC1, pH8.3) 
and IX Klentaq DNA polymerase (Clontech). Five (ixl of the PCR reaction was removed at 
18, 20, and 22 cycles and used for agarose gel electrophoresis. PCR was performed using 

10 an MJ Research thermal cycler under the following conditions: initial denaturation was at 
94°C for 15 sec, followed by a 18, 20, and 22 cycles of 94°C for 15, 65°C for 2 min, 72°C 
for 5 sec. A final extension at 72°C was carried out for 2 min. After agarose gel 
electrophoresis, the band intensities of the 283 bp P-actin bands from multiple tissues were 
compared by visual inspection. Dilution factors for the first strand cDNAs were calculated 

15 to result in equal (3-actin band intensities in all tissues after 22 cycles of PCR. Three rounds 
of normalization were required to achieve equal band intensities in all tissues after 22 cycles 
of PCR. 

To determine expression levels of the 20P2H8 gene, 5 (ml of normalized first strand 
cDNA was analyzed by PCR using 25, 30, and 35 cycles of amplification using the following 
20 primer pairs: 

5? _ TQA AAC CTC CAG ACA CAA GAA _ y ( SE q ID NO: 13 ) 
5' - GGA GAT GGT AGA CAC TGG TGG AGT - 3' (SEQ ID NO: 14) 

25 Semi quantitative expression analysis was achieved by comparing the PCR products at 

cycle numbers that give light band intensities. 

RESULTS : 

The SSH experiment described in the Materials and Methods, supra, led to the 
30 isolation of numerous candidate gene fragment clones (SSH clones). All candidate clones 
were sequenced and subjected to homology analysis against all sequences in the major public 
gene and EST databases in order to provide information on the identity of the 
corresponding gene and to help guide the decision to analyze a particular gene for 
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differential expression. In general, gene fragments which had no homology to any known 
sequence in any of the searched databases, and thus considered to represent novel genes, as 
well as gene fragments showing homology to previously sequenced expressed sequence tags 
(ESTs), were subjected to differential expression analysis by RT-PCR and/or Northern 
5 analysis. 

An SHH clone of 442 bp exhibiting significant homology to ESTs derived from 
several libraries, including libraries made from fetal liver, placenta, and a pool of lung, testis 
and B cells was isolated and designated 20P2H8. Initial expression analysis by RT-PCR 
showed that 20P2H8 is expressed in prostate and all LAPC xenografts (FIG. 2). 
10 Additionally, RT-PCR analysis of first strand cDNA derived from 16 normal tissues showed 
expression primarily in prostate, pancreas, colon and placenta after 25 cycles of 
amplification (FIG. 2). Expression was detected in other tissues after 30 cycles of 
amplification (FIG. 2). This clone, therefore, was utilized for obtaining a full length cDNA 
encoding 20P2H8 as described in Example 2, below. 

15 

EXAMPLE 2 

ISOLATION OF FULL LENGTH cDNA ENCODING THE 2QP2H8 GENE 

The isolated 20P2H8 gene fragment of 442 bp was used as a probe to identify the 
full length cDNA for 20P2H8 in a human prostate cDNA library. This resulted in the 

20 isolation of a 3600 bp cDNA, clone 20P2H8-GTC2, which encodes a 517 amino acid open 
reading frame with homology to heterogenous nuclear ribonucleoproteins (hnRNPs). The 
nucleotide and deduced amino acid sequences encoded by this cDNA are shown in FIG. 1. 
The highest homology is with a protein identified in C. elegans (CAA92704). Significant 
homology is also seen with hnRNPs involved in mRNA splicing (ROF, ROH1 and ROH2). 

25 20P2H8 exhibits five RNA binding sequences, two corresponding to ribonucleoprotein-1 
(RNP1) consensus sites and three corresponding to RNP2 sites (designated in FIG. 1). In 
addition, 20P2H8 contains three regions with significant proline content (30-42%), which lie 
outside regions of homology with hnRNPs (designated in FIG. 1). 

30 EXAMPLE 3 

NORTHERN BLOT ANALYSIS OF 20P2H8 GENE EXPRESSION 

20P2H8 mRNA expression in normal human tissues was first analyzed by Northern 
blotting two multiple tissue blots obtained from Clontech (Palo Alto, California), 
comprising a total of 16 different normal human tissues, using labeled 20P2H8 cDNA as a 
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probe. RNA samples were quantitatively normalized with a P-actin probe. The results are 
shown in FIG. 3, Panels A and B, and indicate that, within the 16 tissues tested, the 20P2H8 
gene is predominantly expressed as a single 4.4 kb transcript in pancreas, with lower level 
expression detected in prostate and colon, and lower level expression in placenta (FIG. 3; 
Panels A and B). No other normal tissues in the panel showed detectable expression. 

In addition, in order to analyze 20P2H8 expression in human cancer tissues and cell 
lines, RNAs derived from LAPC human prostate cancer xenografts and several cancer cell 
lines were analyzed by Northern blot using the 20P2H8 cDNA as probe. All RNA samples 
were quantitatively normalized by ethidium bromide staining and subsequent analysis with a 
labeled P-actin probe. The results of this analysis are presented in FIG. 3, Panel C (LAPC 
xenografts) and FIG. 4 (colon and pancreatic cancer cell lines). The results show up- 
regulated expression of 20P2H8 in LAPC-9 and LAPC-4 xenografts compared to normal 
prostate, with highest levels detected in LAPC-4 AD and LAPC-4 AI (FIG. 3, Panel C) as 
well as high levels of expression in several pancreatic (BxPC-3, HPAC, Capan-1) and colon 
(CaCo-2, T84, Colo-205) cancer cell lines (FIG. 4), suggesting that 20P2H8 is a gene that is 
up-regulated in prostate, pancreatic and colon cancers. 



EXAMPLE 4: 

PRODUCTION OF RECOMBINANT 20P2H8 

To express recombinant 20P2H8 for use in a number of contexts such as analyzing 
the subcellular localization of 20P2H8 protein, a partial or the full length cDNA can cloned 
into any one of a variety of expression vectors such as those that provide a 6His tag at the 
carboxyl-terminus (e.g. pCDNA 3.1 myc-his, InVitrogen). 

In a typical embodiment, an expression vectors construct with a 6His tag at the 
carboxyl-terminus is transfected into 293T cells which are then labeled for one hour with 
35 S-methionine. The cells are then washed and incubated in non-radioactive media to chase 
the labeled proteins for various time points. 20P2H8-His tagged protein is then 
immunoprecipitated using anti-His antibodies (Santa Cruz) from cell extracts and from cell 
supernatant (media) at various time points after the chase. The immunoprecipitates are 
analyzed by SDS-PAGE with subsequent autoradiography to visualize 35 S-methionine 
labeled protein. 

Additional embodiments of typical constructs are provided below. 
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Protein Expression 

In an illustrative embodiment describing the production of recombinant 
20P2H8, 20P2H8 cDNA was cloned into the mammalian expression vector pCDNA 3.1 
(Invitrogen) that contains a 6X His COOH-terminal epitope tag that allows protein 

5 expression analysis using an anti-His pAb reagent. This construct was used to transfect 
293T human embryonic kidney cells to assess 20P2H8 protein expression. As seen in Fig 7 
(panel A), an anti-His immunoreactive band of approximately 60 kD is seen in 293T cells 
transiently transfected with the 20P2H8 vector but not in cells transfected with a control 
empty vector. The molecular weight corresponds to the predicted 57 kD molecular weight 

10 of 20P2H8 cDNA plus the additional amino acids coded by the His and Myc epitope tags. 
Western analysis of Colo 205 cells, a 20P2H8 mRNA positive cell line, with an anti-20P2H8 
polyclonal antibody demonstrates immunoreactive bands of approximately 58 kD and 30 
kD that are not seen in 20P2H8 mRNA negative 293T cells, which is indicative of 
endogenous 20P2H8 protein expression (Fig 7, panel B). 

15 Additional constructs for recombinant expression are provided below. 

pGEX Constructs 

To express 20P2H8 in bacterial cells, a portion of 20P2H8 was fused to the 
Glutathione S-transferase (GST) gene by cloning into pGEX-6P-l (Amersham Pharmacia 
Biotech, NJ). All constructs were made to generate recombinant 20P2H8 protein sequences 

20 with GST fused at the N-terminus and a six histidine epitope at the C-terminus. The six 
histidine epitope tag was generated by adding the histidine codons to the cloning primer at 
the y end of the ORF. A PreScission™ recognition site permits cleavage of the GST tag 
from 20P2H8. The ampicillin resistance gene and pBR322 origin permits selection and 
maintenance of the plasmid in E. coli. In a specific illustrative embodiment, the following 

25 fragment of 20P2H8 was cloned into pGEX-6P-l: Amino acids 1 to 126 (SEQ ID NO: 2). 

Additional constructs can be made in pGEX-6P-l spanning any one of a variety of 
regions within the 20P2H8 protein including, for example a region containing one or more 
of the specified biological motifs discussed above or the following regions of the 20P2H8 
protein: amino acids 1 to 517; amino acids 126 to 252 and; amino acids 252 to 517 (SEQ ID 

30 NO: 2). 

pMAL Constructs 

To express 20P2H8in bacterial cells, portions of 20P2H8 can be fused to the 
maltose-binding protein (MBP) gene by cloning into pMAL-c2X and pMAL-p2X (New 
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England Biolabs, MA). All constructs can be made to generate recombinant 20P2H8 
protein sequences with MBP fused at the N-terminus and a six histidine epitope at the C- 
terminus. The six histidine epitope tag was generated by adding the histidine codons to the 
y cloning primer. A Factor Xa recognition site permits cleavage of the GST tag from 

5 20P2H8. The pMAL-c2X and pMAL-p2X vectors are optimized to express the 
recombinant protein in the cytoplasm or periplasm respectively. Periplasm expression 
enhances folding of proteins with disulfide bonds. These constructs can be made in pMAL- 
c2X and pMAL-p2X and span the following regions of the 20P2H8 protein: amino acids 1 
to 126; amino acids 1 to 517; amino acids 126 to 252; amino acids 252 to 517 (SEQ ID NO: 

10 2). 

pcDNA3.1/MvcHis Construct 

To express 20P2H8 in mammalian cells, the 1551 bp (517 amino acid) 20P2H8 ORF 
was cloned into pcDNA3.1/MycHis_Version A (Invitrogen, Carlsbad, CA). Protein 

15 expression is driven from the cytomegalovirus (CMV) promoter. The recombinant protein 
has the myc and six histidines fused to the C-terminus. The pcDNA3.1/MycHis vector also 
contains the bovine growth hormone (BGH) polyadenylation signal and transcription 
termination sequence to enhance mRNA stability along with the SV40 origin for episomal 
replication and simple vector rescue in cell lines expressing the large T antigen. The 

20 Neomycin resistance gene allows for selection of mammalian cells expressing the protein 
and the ampicillin resistance gene and ColEl origin permits selection and maintenance of 
the plasmid in E. coli. 

pSRa Constructs 

25 To generate mammalian cell lines expressing 20P2H8 constitutively, the 1551 bp 

(517 amino acid) ORF was cloned into pSRa constructs. Amphotropic and ecotropic 
retroviruses are generated by transfection of pSRa constructs into the 293T-10A1 packaging 
line or co-transfection of pSRa and a helper plasmid (cpD ) in 293 cells, respectively. The 
retrovirus can be used to infect a variety of mammalian cell lines, resulting in the integration 

30 of the cloned gene, 20P2H8, into the host cell-lines. Protein expression is driven from a 
long terminal repeat (LTR). The Neomycin resistance gene allows for selection of 
mammalian cells expressing the protein and the ampicillin resistance gene and ColEl origin 
permits selection and maintenance of the plasmid in E. coli. An additional pSRa construct 
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was made that fused the FLAG tag to the C-terminus to allow detection using anti-FLAG 
antibodies. The FLAG sequence 5' gat tac aag gat gac gac gat aag 3 ? were added to cloning 
primer at the 3' end of the ORF. 

Additional pSRa constructs can be made to produce both N-terrninal and C-terminal 
5 GFP and myc/6 HIS fusion proteins of the full length 20P2H8 protein. 

EXAMPLE 5: 

PRODUCTION OF RECOMBINANT 20P2H8 IN A BACULOVIRUS SYSTEM 

To generate recombinant 20P2H8 protein in a baculovirus expression system, 
10 20P2H8 cDNA is cloned into the baculovirus transfer vector pBlueBac 4.5 (Invitrogen) 
which provides a His-tag at the N-terminus. Specifically, pBlueBac-20P2H8 is co- 
transfected with helper plasmid pBac-N-Blue (Invitrogen) into SF9 (Spodoptera frugiperda) 
insect cells to generate recombinant baculovirus (see Invitrogen instruction manual for 
details). Baculovirus is then collected from cell supernatant and purified by plaque assay. 
15 Recombinant 20P2H8 protein is then generated by infection of HighFive insect cells 

(InVitrogen) with the purified baculovirus. Recombinant 20P2H8 protein may be detected 
using 20P2H8-specific antibody. 20P2H8 protein may be purified and used in various cell 
based assays or as immunogen to generate polyclonal and monoclonal antibodies specific 
for 20P2H8. 

20 

EXAMPLE 6: 

GENERATION OF 20P2H8 ANTIBODIES 
Generation of Monoclonal antibodies (mAbs^ 

In a typical method of generating 20P2H8 monoclonal antibodies, a glutathione-S- 
25 transferase (GST) fusion protein encompassing the 20P2H8 protein is synthesized and used 
as immunogen. Balb C mice are initially immunized intraperitoneally with 200 |Lig of the 
GST-20P2H8 fusion protein mixed in complete Freund's adjuvant. Mice are subsequendy 
immunized every 2 weeks with 75 Lig of GST-20P2H8 protein mixed in Freund's 
incomplete adjuvant for a total of 3 immunizations. Reactivity of serum from immunized 
30 mice to full length 20P2H8 protein is monitored by ELISA using a partially purified 
preparation of HIS-tagged 20P2H8 protein expressed from 293T cells. Mice showing the 
strongest reactivity are rested for 3 weeks and given a final injection of fusion protein in 
PBS and then sacrificed 4 days later. The spleens of the sacrificed mice are then harvested 
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and fused to SPO/2 myeloma cells using standard procedures (Harlow and Lane, 1988). 
Supernatants from growth wells following HAT selection are screened by ELISA and 
Western blot to identify 20P2H8 specific antibody producing clones. 

The binding affinity of a 20P2H8 monoclonal antibody may be determined using 

5 standard technology. Affinity measurements quantify the strength of antibody to epitope 
binding and may be used to help define which 20P2H8 monoclonal antibodies are preferred 
for diagnostic or therapeutic use. The BIAcore system (Uppsala, Sweden) is a preferred 
method for determining binding affinity. The BIAcore system uses surface plasmon 
resonance (SPR, Welford K. 1991, Opt. Quant Elect. 23:1; Morton and Myszka, 1998, 

10 Methods in Enzymology 295: 268) to monitor biomolecular interactions in real time. 
BIAcore analysis conveniently generates association rate constants, dissociation rate 
constants, equilibrium dissociation constants, and affinity constants. 



Generation of polyclonal antibodies (pAbs) 

15 To generate polyclonal sera to 20P2H8, a peptide was synthesized corresponding to 

amino acids 224-237 (QNALRKHKDLLGKR) of 20P2H8 protein sequence. The peptide 
was coupled to Keyhole limpet hemacyanin (KLH) and used to immunize a rabbit as 
follows. The rabbit was initially immunized with 200 ug of peptide-KLH mixed in complete 
Freund's adjuvant The rabbit was then injected every two weeks with 200 ug of peptide- 

20 KLH in incomplete Freund's adjuvant. Bleeds were taken approximately 7-10 days following 
each immunization. ELISA and Western blotting analyses were used to determine 
specificity and titer of the rabbit serum to the immunizing peptide and the 20P2H8 protein 
respectively. Affinity purified 20P2H8 polyclonal antibodies were prepared by passage of 
crude serum from immunized rabbit over an affinity matrix comprised of 20P2H8 peptide 

25 covalently coupled to Affigel 10 (BioRad). After extensive washing of the matrix with PBS, 
antibodies specific to 20P2H8 peptide were eluted with low pH glycine buffer (0,1M, pH 
2.5). Western blotting reveals the appearance of novel anti-20P2H8 immunoreactive bands 
of approximately 58 kD and 30 kD in 20P2H8 mRNA-expressing COLO 205 cells but not 
in 20P2H8 mRNA-negative 293T cells (Figure 7, panel B). 

30 

EXAMPLE 7: 

IDENTIFICATION OF POTENTIAL SIGNAL TRANSDUCTION PATHWAYS 

To determine whether 20P2H8 directly or indirectly activates known signal transduction 
pathways in cells, luciferase Que) based transcriptional reporter assays are carried out in cells 
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expressing 20P2H8. These transcriptional reporters contain consensus binding sites for 
known transcription factors which lie downstream of well characterised signal transduction 
pathways. The reporters and examples of there associated transcription factors, signal 
transduction pathways, and activation stimuli are listed below. 

5 

1. NFkB-luc, NFkB/Rel; Ik-kinase/SAPK; growth/apoptosis/stress 

2. SRE-luc, SRF / TCF/ELK1 ; MAPK/SAPK; growth/differentiation 

3. AP-l-luc, FOS/JUN; MAPK/SAPK/PKC; growth/apoptosis/stress 

4. ARE-luc, androgen receptor; steroids/MAPK; growth/differentiation/apoptosis 
10 5. p53-luc, p53; SAPK; growth/differentiation/apoptosis 

6. CRE-luc, CREB/ATF2; PKA/p38; growth/apoptosis/stress 

20P2H8-mediated effects may be assayed in cells showing mRNA expression, such 
as the 20P2H8-expressing cancer cell lines shown in FIG. 4. Luciferase reporter plasmids 

15 may be introduced by lipid mediated transfection (TFX-50, Promega). Luciferase activity, 
an indicator of relative transcriptional activity, is measured by incubation of cells extracts 
with luciferin substrate and luminescence of the reaction is monitored in a luminometer. 

The transcriptional activity of 20P2H8 may be confirmed using electromobility shift 
assays (EMSA). Cells expressing 20P2H8 will be evaluated for their ability to bind known 

20 response elements and compared to control 20P2H8 negative cells. Whole cell and nuclear 
extracts will be used in this assay, and will be analyzed in the presence and absence of 
potential (i) 20P2H8 inhibitors and (ii) 20P2H8 interacting proteins. These assays will 
provide us with valuable information regarding gene candidates and biologic pathways 
regulated by 20P2H8 and the mechanism by which 20P2H8 controls gene expression. 

25 

EXAMPLE 8: 

IN VITRO ASSAYS FOR CHARACTERIZING 20P2H8 

Subcellular Locali2ation of 20P2H8. Sequence analysis of 20P2H8 revealed the 
presence of RNP like domains and suggests that 20P2H8 may have an RNA splicing 
30 function. This, in turn, indicates that 20P2H8 may have a nuclear localization. The cellular 
location of 20P2H8 can be assessed using subcellular fractionation techniques widely used in 
cellular biology (Storrie B, et al. Methods Enzymol. 1990;182:203-25). Prostate, colon or 
pancreatic cells can be lysed and separated into nuclear, cytosolic and membrane fractions. 
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The expression of 20P2H8 in the different fractions can be tested using western blotting 
techniques. 

EXAMPLE 9: 

5 In Vitro Assays Of 20P2H8 Function. 20P2H8 function can be assessed in 

mammalian cells using in vitro approaches. For mammalian expression, 20P2H8 can be 
cloned into a number of appropriate vectors, including pcDNA 3.1 myc-His-tag and the 
retroviral vector pSRottkneo (Muller et aL, 1991, MCB 11:1785). Using such expression 
vectors, 20P2H8 can be expressed in several cancer cell lines, including for example PC-3, 

10 NIH 3T3, LNCaP and 293T. Expression of 20P2H8 can be monitored using anti-20P2H8 
antibodies. 

Mammalian cell lines expressing 20P2H8 can be tested in several in vitro and in vivo 
assays, including cell proliferation in tissue culture, activation of apoptotic signals, primary 
and metastatic tumor formation in SCID mice, and in vitro invasion using a membrane 

15 invasion culture system (MICS) (Welch et al. ,Int. J. Cancer 43: 449-457). 20P2H8 cell 
phenotype is compared to the phenotype of cells that lack expression of 20P2H8. 

Cell lines expressing 20P2H8 can also be assayed for alteration of invasive and 
migratory properties by measuring passage of cells through a matrigel coated porous 
membrane chamber (Becton Dickinson). Passage of cells through the membrane to the 

20 opposite side is monitored using a fluorescent assay (Becton Dickinson Technical Bulletin 
#428) using calcein-Am (Molecular Probes) loaded indicator cells. Cell lines analyzed 
include parental and 20P2H8 overexpressing PC3, 3T3 and LNCaP cells. To assay whether 
20P2H8 has chemoattractant properties, parental indicator cells are monitored for passage 
through the porous membrane toward a gradient of 20P2H8 conditioned media compared 

25 to control media. This assay may also be used to qualify and quantify specific neutralization 
of the 20P2H8 induced effect by candidate cancer therapeutic compositions. 

The function of 20P2H8 can be evaluated using anti-sense RNA technology coupled 
to the various functional assays described within this section, e.g. growth, invasion and 
migration. Anti-sense RNA oligonucleotides can be introduced into 20P2H8 expressing 

30 cells, thereby preventing the expression of 20P2H8. Control and anti-sense containing cells 
can be analyzed for proliferation, invasion, migration, apoptotic and transcriptional 
potential. The local as well as systemic effect of the loss of 20P2H8 expression can be 
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evaluated. In addition to confirming the function of 20P2H8, anti-sense oligonucleotides 
may be used as a therapeutic agent. 

EXAMPLE 10: 

5 IN VIVO ASSAY FOR 20P2H8 TUMOR GROWTH PROMOTION 

The effect of the 20P2H8 protein on tumor cell growth may be evaluated in vivo by 
gene overexpression in tumor-bearing mice. For example, SCID mice can be injected SQ 
on each flank with 1 x 10 6 of a number of prostate, colon or pancreatic cell lines containing 
tkNeo empty vector or 20P2H8. At least two strategies may be used: (1) Constitutive 

10 20P2H8 expression under regulation of an LTR promoter, and (2) Regulated expression 
under control of an inducible vector system, such as ecdysone, tet, etc. Tumor volume is 
then monitored at the appearance of palpable tumors and followed over time to determine 
if 20P2H8 expressing cells grow at a faster rate. Additionally, mice may be implanted with 1 
x 10 5 of the same cells orthotopically to determine if 20P2H8 has an effect on local growth 

15 in the target tissue (i.e., prostate) or on the ability of the cells to metastasize, specifically to 
lungs, lymph nodes, liver, bone marrow, etc. In relation to prostate cancer, the effect of 
20P2H8 on bone tumor formation and growth may be assessed by injecting prostate tumor 
cells intratibially, as described in W098/16628. 

These assays are also useful to determine the 20P2H8 inhibitory effect of candidate 

20 therapeutic compositions, such as for example, 20P2H8 antibodies, 20P2H8 antisense 
molecules and ribozymes. 

EXAMPLE 11: IN VITRO ASSAY OF 20P2H8 PROTEIN INTERACTION. 

Cell lines expressing 20P2H8 can also be used to identify protein-protein 
25 interactions mediated by 20P2H8. The presence of proline-rich regions and homology to 

RNP in the ORF suggest that 20P2H8 may interact with other RNPs. This possibility can be 

examined using immunoprecipitation techniques as shown by others (Hamilton BJ, et al. 

Biochem. Biophys. Res. Commun. 1999, 261:646-51). 20P2H8 protein can be 

immunoprecipitated from 20P2H8 expressing prostate cancer cell lines and examined for 
30 protein association by western blotting. Protein interaction may be also studied by a two 

yeast hybrid system, as described by Shnyreva et. Al. (Shnyreva M et al, J Biol Chem. 2000. 

19;275 ;1 5498-503). These assays may also be used to analyze the effect of potential cancer 

therapeutics on 20P2H8 function. 
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To determine the contribution of the various domains contained within the 20P2H8 
ORF to 20P2H8 function, 20P2H8 mutants can be generated lacking one or more domains. 
Cell lines expressing mutant 20P2H8 protein will be evaluated for alteration in proliferation, 
invasion, migration, transcriptional activation and protein-protein interaction. 

5 

Example 12: Chromosomal Localization of 20P2H8 

The chromosomal localization of 20P2H8 was determined using the GeneBridge4 
radiation hybrid panel (Walter et al., 1994, Nat. Genetics 7:22) (Research Genetics, 
Huntsville Al). The following PCR primers were used to localize 20P2H8: 
10 20P2H8.5 tcttgaaacctccagacacaagaa (SEQIDNO:15) 
20P2H8.6 aagttacgatttggcttcactgg (SEQ ID NO: 16) 

The resulting mapping vector for the 93 radiation hybrid panel DNAs was: 
0000001001001010000200001101000000010100101011100010000101011010000210111000 
15 00001101010000001 

This the mapping program which can be found at internet address http:/ /www- 
genome. wi.mitedu/cgi-bin/contig/rhmapper.pl maps 85P1B3 to chromosome 8q22.2- 
23.1 / 15q22.32-23. As HTGS hits show PACs on chromosome 15 therefore this is the 
20 most likely mapping location. 



Throughout this application, various publications are referenced. The disclosures of 
these publications are hereby incorporated by reference herein in their entireties. 

25 The present invention is not to be limited in scope by the embodiments disclosed 

herein, which are intended as single illustrations of individual aspects of the invention, and 
any which are functionally equivalent are within the scope of the invention. Various 
modifications to the models and methods of the invention, in addition to those described 
herein, will become apparent to those skilled in the art from the foregoing description and 

30 teachings, and are similarly intended to fall within the scope of the invention. Such 
modifications or other embodiments can be practiced without departing from the true scope 
and spirit of the invention. 
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TABLES 

TABLE 1 



Rank 


Start 
Position 


Subsequence Residue Listing 


Score (Estimate of half time 
of disassociation) 


1 


304 


FLGEFATDI (SEQ ID NO: 17) 


747.7 


2 


215 


VLFACEEYA (SEQ ID NO: 
18) 


84.2 


3 


233 


LLGKRYIEL (SEQ ID NO: 
19) 


54.5 


4 


275 


QQFVPPTNV (SEQ ID NO: 
20) 


26.1 


5 


363 


CSAEEMNFV (SEQ ID NO: 
21) 


23.6 


6 


159 


FLSKENQVI (SEQ ID NO: 22) 


19.7 


7 


450 


SLGYFPTAA (SEQ ID NO: 
23) 


18.9 


8 


509 


TLPKEWVCI (SEQ ID NO: 
24) 


17.7 


9 


393 


YTFPAPAAV (SEQ ID NO: 
25) 


16.4 


10 


492 


YQYATEDGL (SEQ ID NO: 
26) 


15.6 
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CLAIMS: 

1 . A polynucleotide that encodes a 20P2H8 polypeptide, wherein the polynucleotide is 
selected from the group consisting of: 

5 (a) a polynucleotide having the sequence as shown in FIG. 1 (SEQ ID NO: 1), 

wherein T can also be U; 

(b) a polynucleotide having the sequence as shown in FIG. 1 (SEQ ID NO: 1), 
from about nucleotide residue number 450 through about nucleotide residue 
number 2000, wherein T can also be U; 

10 (c) a polynucleotide encoding a 20P2H8 polypeptide whose sequence is 

encoded by the cDNA contained in the plasmid deposited with American 
Type Culture Collection as Accession No. 207151; 

(d) a polynucleotide encoding a 20P2H8 protein having the amino acid sequence 
shown in FIG. 1 (SEQ ID NO: 2); and 

15 (e) a polynucleotide that is fully complementary to a polynucleotide of any one 

of(a)-(d). 

2. A polynucleotide that encodes a polypeptide that is at least 90% identical to the 
amino acid sequence shown in FIG. 1 (SEQ ID NO: 2) over its entire length. 

3. A fragment of a polynucleotide of claim 1 comprising: 

20 (a) a polynucleotide having the sequence as shown in FIG. 1 (SEQ ID NO: 1), 

from nucleotide residue number 600 through about nucleotide residue 
number 3600; 

(b) a polynucleotide that is a fragment of the polynucleotide of (a) that is at least 
20 nucleotide bases in length; or 

25 a polynucleotide that selectively hybridizes under stringent conditions to the polynucleotide 
of (a) or (b). 
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4. A polynucleotide that encodes a 20P2H8 polypeptide, wherein the polypeptide 
includes an amino acid sequence selected from the group consisting of NYTA at 
residues 436-439 (SEQ ID NO: 2), NLSG at residues 459-462 (SEQ ID NO: 2), 
SKME at residues 56-59 (SEQ ID NO: 2), SDQD at residues 77-80 (SEQ ID NO: 

5 2), TGED at residues 141-144 (SEQ ID NO: 2), TSNE at residues 152-155 (SEQ 

ID NO: 2), TAEE at residues 178-181 (SEQ ID NO: 2), TYPD at residues 203-206 
(SEQ ID NO: 2), TAAE at residues 245-248 (SEQ ID NO: 2), TIED at residues 
297-300 (SEQ ID NO: 2), SAEE at residues 364-367 (SEQ ID NO: 2), GLNIAK at 
residues 87-92 (SEQ ID NO: 2), GAALCL at residues 94-99 (SEQ ID NO: 2), 

10 GGTSNE at residues 150-155 (SEQ ID NO: 2), GLPFTA at residues 172-177 

(SEQ ID NO: 2), GGKEGI at residues 194-199 (SEQ ID NO: 2), GLPYAA at 
residues 291-296 (SEQ ID NO: 2), GGTLNR at residues 374-379 (SEQ ID NO: 2), 
GSPNSL at residues 446-451 (SEQ ID NO: 2), GLAYNT at residues 475-480 
(SEQ ID NO: 2), GLIHTN at residues 499-504 (SEQ ID NO: 2), QGRR at 

15 residues 102-105 (SEQ ID NO: 2), LGKR at residues 234-237 (SEQ ID NO: 2), 

FLGEFATDI (SEQ ID NO: 17), VLFACEEYA (SEQ ID NO: 18), LLGKRYIEL 
(SEQ ID NO: 19), QQFVPPTNV (SEQ ID NO: 20), CSAEEMNFV (SEQ ID 
NO: 21), FLSKENQVI (SEQ ID NO: 22), SLGYFPTAA (SEQ ID NO: 23), 
TLPKEWVCI (SEQ ID NO: 24), YTFPAPAAV (SEQ ID NO: 25) and 

20 YQYATEDGL (SEQ ID NO: 26). 

5. A polynucleotide of claim 1 that is labeled with a detectable marker. 

6. A recombinant expression vector that contains a polynucleotide of claim 1 . 

7. A host cell that contains an expression vector of claim 6. 

8. A process for producing a 20P2H8 polypeptide comprising culturing a host cell of 
25 claim 7 under conditions sufficient for the production of the polypeptide. 

9. The process of claim 8, further comprising recovering the 20P2H8 polypeptide so 
produced. 

10. A 20P2H8 polypeptide produced by the process of claim 8. 

11. A 20P2H8 polypeptide encoded by the polynucleotide of claim 1 . 
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12. A polypeptide comprising at least 15 contiguous amino acids of the polypeptide of 
claim 1 1 . 

13. An antibody or fragment thereof that specifically binds to the 20P2H8 polypeptide 
of claim 11. 

5 14. The antibody or fragment thereof of claim 13, which is monoclonal. 

15. A recombinant protein comprising the antigen binding region of a monoclonal 
antibody of claim 14. 

16. The antibody or fragment thereof of claim 13, which is labeled with a detectable 
marker. 

10 17. The antibody or fragment thereof of claim 16, wherein the detectable marker is 
selected from the group consisting of a radioisotope, fluorescent compound, 
bioluminescent compound, chemiluminescent compound, metal chelator or enzyme. 

18. The antibody fragment of claim 13, which is an Fab, F(ab')2, Fv or sFv fragment. 

19. The antibody or fragment thereof of claim 13, which is a human antibody. 

15 20. The antibody or fragment thereof of claim 13, which is conjugated to a toxin or a 
therapeutic agent. 

21. The antibody of claim 13, which comprises murine antigen binding region residues 
and human antibody residues. 

22. A transgenic animal producing a monoclonal antibody of claim 14. 
20 23. A hybridoma producing a monoclonal antibody of claim 14. 

24. A single chain monoclonal antibody that comprises the variable domains of the 
heavy and light chains of a monoclonal antibody of claim 14. 

25. A vector comprising a polynucleotide encoding a single chain monoclonal antibody 
of claim 24. 

25 26. An assay for detecting the presence of a 20P2H8 protein in a biological sample 
comprising contacting the sample with an antibody or fragment thereof or 
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recombinant protein of claim 16, and detecting the binding of 20P2H8 protein in 
the sample thereto. 

An assay for detecting the presence of a 20P2H8 polynucleotide in a biological 
sample, comprising 

(a) contacting the sample with a polynucleotide probe that specifically 
hybridizes to the polynucleotide of claim 1; and 

(b) detecting the presence of a hybridization complex formed by the 
hybridization of the probe with 20P2H8 polynucleotide in the sample, 
wherein the presence of the hybridization complex indicates the presence of 
20P2H8 polynucleotide within the sample. 

An assay for detecting the presence of 20P2H8 mRNA in a biological sample 
comprising: 

(a) producing cDNA from the sample by reverse transcription using at least one 
primer; 

(b) amplifying the cDNA so produced using 20P2H8 polynucleotides as sense 
and antisense primers to amplify 20P2H8 cDNAs therein; 

(c) detecting the presence of the amplified 20P2H8 cDNA, 

wherein the 20P2H8 polynucleotides used as the sense and antisense probes are 
capable of amplifying the 20P2H8 cDNA contained within the plasmid deposited 
with American Type Culture Collection as Accession No. 207151. 

A method of examining a biological sample for evidence of dysregulated cellular 
growth comprising comparing the status of 20P2H8 in the biological sample to the 
status of 20P2H8 in a corresponding normal sample, wherein alterations in the 
status of 20P2H8 in the biological sample are associated with dysregulated cellular 
growth. 

A method of identifying evidence of a neoplasm in a biological sample comprising: 
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(a) examining a level of 20P2H8 gene expression in a test biological sample; 



and 



(b) comparing the level of 20P2H8 gene expression in the test biological 
sample to a level of 20P2H8 gene expression found in a comparable normal 



10 31. 
15 

32. 



20 

33. 



biological sample, 

wherein differences in the level of 20P2H8 gene products in the test 
biological sample relative to the normal biological sample are associated with 
the neoplasm. 

A method of monitoring 20P2H8 gene products comprising determining the status 
of 20P2H8 gene products expressed by cells in a test tissue sample from an 
individual and comparing the status so determined to the status of 20P2H8 gene 
products in a corresponding normal sample, the presence of aberrant 20P2H8 gene 
products in the test sample relative to the normal sample providing an indication of 
dysregulated cell growth within the individual. 

A method of diagnosing the presence of cancer in an individual comprising: 

(a) determining the level of 20P2H8 mRNA expressed in a test sample obtained 
from the individual; and 

(b) comparing the level so determined to the level of 20P2H8 mRNA expressed 
in a comparable known normal tissue sample, 

the presence of elevated 20P2H8 mRNA expression in the test sample relative to the 
normal tissue sample providing an indication of the presence of cancer. 

A method of diagnosing the presence of cancer in an individual comprising: 

(a) determining the level of 20P2H8 protein expressed in a test sample obtained 
from the individual; and 

(b) comparing the level so determined to the level of 20P2H8 protein expressed 
in a comparable known normal tissue sample, 

the presence of elevated 20P2H8 protein in the test sample relative to the normal 
tissue sample providing an indication of the presence of cancer. 
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34. The method of claim 32, wherein the cancer is prostate, cervical, kidney, stomach, 
skin, pancreatic, colon, bladder, breast, lung, testicular or ovarian cancer, and the test 
and normal tissue samples are selected from the group consisting of prostate tissue, 
kidney tissue, cervical tissue, skin tissue, stomach tissue, colon tissue, bladder tissue, 

5 breast tissue, lung tissue, pancreatic tissue, rectal tissue, ovarian tissue, testicular 

tissue, lymphatic tissue, serum, blood or semen. 

35. A pharmaceutical composition comprising a 20P2H8 polypeptide of claim of claim 
11 or an immunogenic portion thereof, and a physiologically acceptable carrier. 

36. A pharmaceutical composition comprising the vector of claim 25, and a 
10 physiologically acceptable carrier. 

37. A pharmaceutical composition comprising an antisense polynucleotide 
complementary to or a ribozyme capable of cleaving a polynucleotide of claim 1, 
and a physiologically acceptable carrier. 

38. A pharmaceutical composition comprising an antibody or fragment thereof of claim 
15 13, and a physiologically acceptable carrier. 

39. A method of treating a patient with a cancer that expresses 20P2H8 which 
comprises administering to said patient a vector according to claim 25, such that the 
vector delivers the single chain monoclonal antibody coding sequence to the cancer 
cells and the encoded single chain antibody is expressed intracellularly therein, 

20 40. A method of treating a patient with a cancer that expresses 20P2H8 which 
comprises administering to said patient a composition of claim 37. 

41. A vaccine composition for the treatment of a cancer expressing 20P2H8 comprising 
an immunogenic portion of a 20P2H8 protein and a physiologically acceptable 
carrier. 

25 42. A method of inhibiting the development of a cancer expressing 20P2H8 in a patient, 
comprising administering to the patient an effective amount of the vaccine 
composition of claim 40. 



79 



# # 

A method of identifying a molecule that modulates a biological activity of 20P2H8 
comprising: 



(a) contacting a molecule with a cell that expresses 20P2H8; 

(b) assaying a biological activity of 20P2H8 in the presence and absence of the 
molecule; and 

(c) determining whether the biological activity of 20P2H8 is altered by the 
presence of the molecule, an alteration in the biological activity of 20P2H8 
being indicative of a molecule that modulates a biological activity of 
20P2H8. 
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ABSTRACT 

The present invention relates to a novel protein designated 20P2H8 which shares 
homology with several heterogenous nuclear ribonucleoproteins (hnRNPs). A full length 
approximately 3600 bp 20P2H8 cDNA (SEQ ID NO: 10, encoding a 517 amino acid open 
reading frame (SEQ ID NO: 2), is provided herein. 
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solicit a United States patent. 
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<110> Daniel E.H. Afar 
Arthur B. Raitano 
Rene S. Hubert 
Steve Chappell Mitchell 
Ay a Jakobovits 

<12 0> NOVEL GENE UPREGULATED IN CANCERS OF THE 
PROSTATE 



<130> 129.21-US-U1 

<150> 60/162,364 
<151> 1999-10-28 



<160> 26 



<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 3585 

<212> DNA 

<213> Homo Sapiens 



<400> 1 

ctttttggga tcactgctgg ggccaccggg gccaagctag gctcggatga gaaggagttg 60 

atcctgctgt tctggaaagt cgtggatctg gccaacaaga aggtgggaca gttgcacgaa ^™ 
gtgctagtta gaccggatca gttggaactg acggaggact gcaaagaaga aactaaaata 
gacgtcgaaa gcctgtcctc ggcgtcgcag ctggaccaag ccctccgaca gtttaaccag 
tcagtgagca atgaactgaa tattggagta gggacttcct tctgtctctg tactgatggg 

cagcttcatg tcaggcaaat cctgcatcct gaggcttcca agaagaatgt actattacct 360 

gaatgcttct attccttttt tgatcttcga aaagaattca agaaatgttg ccctggttca Ar >" 
cctgatattg acaaactgga cgttgccaca atgacagagt atttaaattt tgagaagagt 
agttcagtct ctcgatatgg agcctctcaa gttgaagata tggggaatat aattttagca 
atgatttcag agccttataa tcacaggttt tcagatccag agagagtgaa ttacaagttt 
gaaagtggaa cttgcagcaa gatggaactt attgatgata acaccgtagt cagggcacga 

ggtttaccat ggcagtcttc agatcaagat attgcaagat tcttcaaagg actcaatatt 720 

gccaagggag gtgcagcact ttgtctgaat gctcagggtc gaaggaacgg agaagctctg 780 

gttaggtttg taagtgagga gcaccgagac ctagcactac agaggcacaa acatcacatg QAn 
gggacccggt atattgaggt ttacaaagca acaggtgaag atttccttaa aattgctggt 
ggtacttcca atgaggtagc ccagtttctc tccaaggaaa atcaagtcat tgttcgcatg 

cgggggctcc ctttcacggc cacagctgaa gaagtggtgg ccttctttgg acagcattgc 1020 

cctattactg ggggaaagga aggcatcctc tttgtcacct acccagatgg taggccaaca inQn 
ggggacgctt ttgtcctctt tgcctgtgag gaatatgcac agaatgcgtt gaggaagcat 
aaagacttgt tgggtaaaag atacattgaa ctcttcagga gcacagcagc tgaagttcag 
caggtgctga atcgattctc ctcggcccct ctcattccac ttccaacccc tcccattatt 

ccagtactac ctcagcaatt tgtgccccct acaaatgtta gagactgtat acgccttcga 1320 

ggtcttccct atgcagccac aattgaggac atcctggatt tcctggggga gttcgccaca 1380 

gatattcgta ctcatggggt tcacatggtt ttgaatcacc agggccgccc atcaggagat 1440 

gcctttatcc agatgaagtc tgcggacaga gcatttatgg ctgcacagaa gtgtcataaa 1500 

aaaaacatga aggacagata tgttgaagtc tttcagtgtt cagctgagga gatgaacttt 1560 

gtgttaatgg ggggcacttt aaatcgaaat ggcttatccc caccgccatg cctgtctcct 1620 

ccctcctaca catttccagc tcctgctgca gttattccta cagaagctgc catttaccag 1680 

ccctctgtga ttttgaatcc acgagcactg cagccctcca cagcgtacta cccagcaggc 1740 

actcagctct tcatgaacta cacagcgtac tatcccagcc ccccaggttc gcctaatagt 1800 

cttggctact tccctacagc tgctaatctt agcggtgtcc ctccacagcc tggcacggtg 1860 

gtcagaatgc agggcctggc ctacaatact ggagttaagg aaattcttaa cttcttccaa 1920 

ggttaccagt atgcaaccga ggatggactt atacacacaa atgaccaggc caggactcta 1980 

cccaaagaat gggtttgtat ttaagggccc cagcagttag aacatcctca gaaaagaagt 2 04 0 

gtttgaaaga tgtatggtga tcttgaaacc tccagacaca agaaaacttc tagcaaattc 2100 

aggggaagtt tgtctacact caggctgcag tattttcagc aaacttgatt ggacaaacgg 2160 



120 
180 
240 
300 



420 
480 
540 
600 
660 



840 
900 
960 



1080 
1140 
1200 
1260 



1 



gcctgtgcct tatcttttgg tggagtgaaa aaatttgagc tagtgaagcc aaatcgtaac 2220 

ttacagcaag cagcatgcag catacctggc tctttgctga ttgcaaatag gcatttaaaa 2280 

tgtgaatttg gaatcagatg tctccattac ttccagttaa agtggcatca taggtgtttc 2340 

ctaagtttta agtcttggat aaaaactcca ccagtgtcta ccatctccac catgaactct 2400 

gttaaggaag cttcattttt gtatattccc gctcttttct cttcatttcc ctgtcttctg 2460 

cataatcatg ccttcttgct aagtaattca agcataagat cttggaataa taaaatcaca 2520 

atcttaggag aaagaataaa attgttattt tcccagtctc ttggccatga tgatatctta 2580 

tgattaaaaa caaattaaat tttaaaacac ctgaagataa attagaagaa attgtgcacc 2640 

ctccacaaaa catacaaagt ttaaaagttt ggatcttttt ctcagcaggt atcagttgta 2700 

aataatgaat taggggccaa aatgcaaaac gaaaaatgaa gcagctacat gtagttagta 2760 

atttctagtt tgaactgtaa ttgaatattg tggcttcata tgtattattt tatattgtac 2820 

ttttttcatt attgatggtt tggactttaa taagagaaat tccatagttt ttaatatccc 2880 

agaagtgaga caatttgaac agtgtattct agaaaacaat acactaactg aacagaagtg 294 0 

aatgcttata tatattatga tagccttaaa cctttttcct ctaatgcctt aactgtcaaa 3000 

taattataac cttttaaagc ataggactat agtcagcatg ctagactgag aggtaaacac 3060 

tgatgcaatt agaacaggta ctgatgctgt cagtgtttaa cactatgttt agctgtgttt 3120 

atgctataaa agtgcaatat tagacactag ctagtactgc tgcctcatgt aactccaaag 3180 

aaaacaggat ttcattaagt gcattgaatg tggatatttc tctaagttac tcatattgtc 3240 

ctttgcttga atgcaatgcc gtgcagattt atgaggctgc tatttttatt ttctgtgcat 3300 

tactttaaca ccttaaaggg agaagcaaac atttccttct tcagctgact ggcaatggcc 3360 

ctttaactgc aataggaaga aaaaaaaaaa ggtttgtgtg aaaattggtg ataactggca 342 0 

cttaagatcg aaaagaaatt tctgtatact tgatgcctta agatgcccaa agctgcccaa 3480 

agctctgaaa gactttaaga taggcagtaa tgcttactac aatactactg agtttttgta 3540 

gagttaacat ttgataataa aacttgcctg tttaatctca aaaaa 3585 

<210> 2 
<211> 517 
<212> PRT 

<213> Homo Sapiens 
<400> 2 

Met Thr Glu Tyr Leu Asn Phe Glu Lys Ser Ser Ser Val Ser Arg Tyr 

1 5 10 15 

Gly Ala Ser Gin Val Glu Asp Met Gly Asn He He Leu Ala Met He 

20 25 30 

Ser Glu Pro Tyr Asn His Arg Phe Ser Asp Pro Glu Arg Val Asn Tyr 

35 40 45 

Lys Phe Glu Ser Gly Thr Cys Ser Lys Met Glu Leu He Asp Asp Asn 

50 55 60 

Thr Val Val Arg Ala Arg Gly Leu Pro Trp Gin Ser Ser Asp Gin Asp 
65 70 75 80 

He Ala Arg Phe Phe Lys Gly Leu Asn He Ala Lys Gly Gly Ala Ala 

85 90 95 

Leu Cys Leu Asn Ala Gin Gly Arg Arg Asn Gly Glu Ala Leu Val Arg 

100 105 HO 

Phe Val Ser Glu Glu His Arg Asp Leu Ala Leu Gin Arg His Lys His 

115 120 125 

His Met Gly Thr Arg Tyr He Glu Val Tyr Lys Ala Thr Gly Glu Asp 

130 135 140 

Phe Leu Lys He Ala Gly Gly Thr Ser Asn Glu Val Ala Gin Phe Leu 
145 150 155 160 

Ser Lys Glu Asn Gin Val He Val Arg Met Arg Gly Leu Pro Phe Thr 

165 170 175 

Ala Thr Ala Glu Glu Val Val Ala Phe Phe Gly Gin His Cys Pro He 

180 185 190 

Thr Gly Gly Lys Glu Gly He Leu Phe Val Thr Tyr Pro Asp Gly Arg 

195 200 205 

Pro Thr Gly Asp Ala Phe Val Leu Phe Ala Cys Glu Glu Tyr Ala Gin 

210 215 220 

Asn Ala Leu Arg Lys His Lys Asp Leu Leu Gly Lys Arg Tyr He Glu 
225 230 235 240 

Leu Phe Arg Ser Thr Ala Ala Glu Val Gin Gin Val Leu Asn Arg Phe 
245 250 255 



2 



Ser Ser Ala Pro 
260 

Leu Pro Gin Gin 
275 

Leu Arg Gly Leu 
290 

Leu Gly Glu Phe 
305 

Leu Asn His Gin 

Ser Ala Asp Arg 
340 

Met Lys Asp Arg 
355 

Asn Phe Val Leu 
370 

Pro Pro Cys Leu 
385 

Val He Pro Thr 

Pro Arg Ala Leu 
420 

Leu Phe Met Asn 
435 

Asn Ser Leu Gly 
450 

Pro Gin Pro Gly 

465 

Gly Val Lys Glu 

Glu Asp Gly Leu 
500 

Glu Trp Val Cys 
515 



Leu He Pro Leu 

Phe Val Pro Pro 
280 

Pro Tyr Ala Ala 
295 

Ala Thr Asp He 
310 

Gly Arg Pro Ser 
325 

Ala Phe Met Ala 

Tyr Val Glu Val 
360 

Met Gly Gly Thr 
375 

Ser Pro Pro Ser 
390 

Glu Ala Ala He 
405 

Gin Pro Ser Thr 

Tyr Thr Ala Tyr 
440 

Tyr Phe Pro Thr 
455 

Thr Val Val Arg 

470 

He Leu Asn Phe 
485 

He His Thr Asn 
He 



Pro Thr Pro Pro 
265 

Thr Asn Val Arg 

Thr He Glu Asp 
300 

Arg Thr His Gly 
315 

Gly Asp Ala Phe 
330 

Ala Gin Lys Cys 
345 

Phe Gin Cys Ser 

Leu Asn Arg Asn 
380 

Tyr Thr Phe Pro 
395 

Tyr Gin Pro Ser 
410 

Ala Tyr Tyr Pro 
425 

Tyr Pro Ser Pro 

Ala Ala Asn Leu 
460 

Met Gin Gly Leu 
475 

Phe Gin Gly Tyr 
490 

Asp Gin Ala Arg 
505 



He He Pro Val 
270 

Asp Cys He Arg 
285 

He Leu Asp Phe 

Val His Met Val 
320 

He Gin Met Lys 
335 

His Lys Lys Asn 
350 

Ala Glu Glu Met 
365 

Gly Leu Ser Pro 

Ala Pro Ala Ala 
400 

Val He Leu Asn 
415 

Ala Gly Thr Gin 
430 

Pro Gly Ser Pro 

445 

Ser Gly Val Pro 

Ala Tyr Asn Thr 
480 

Gin Tyr Ala Thr 
495 

Thr Leu Pro Lys 
510 



<210> 3 
<211> 14 
<212> DNA 
<213> Synthetic 

<400> 3 

ttttgatcaa gctt 

<210> 4 
<211> 42 
<212> DNA 
<213> Synthetic 



<400> 4 

ctaatacgac tcactatagg gctcgagcgg ccgcccgggc ag 

<210> 5 
<211> 12 
<212> DNA 
<213> Synthetic 

<400> 5 
ggcccgtcct ag 



<210> 6 
<211> 40 
<212> DNA 
<213> Synthetic 



3 



<400> 6 

gtaatacgac tcactatagg gcagcgtggt cgcggccgag 



40 



<210> 7 
<211> 10 
<212> DNA 
<213> Synthetic 

<400> 7 

cggctcctag 10 

<210> 8 
<211> 22 
<212> DNA 
<213> Synthetic 

<400> 8 

ctaatacgac tcactatagg gc 22 

<210> 9 
<211> 22 
<212> DNA 
<213> Synthetic 

<400> 9 

tcgagcggcc gcccgggcag ga 22 

<210> 10 
<211> 20 
<212> DNA 
<213> Synthetic 

<400> 10 

agcgtggtcg cggccgagga 2 0 

<210> 11 
<211> 25 
<212> DNA 

<213> Homo Sapiens 
<400> 11 

atatcgccgc gctcgtcgtc gacaa 25 

<210> 12 

<211> 26 

<212> DNA 

<213> Homo Sapiens 

<400> 12 

agccacacgc agctcattgt agaagg 26 

<210> 13 
<211> 24 
<212> DNA 
<213> Synthetic 

<400> 13 

tcttgaaacc tccagacaca agaa 24 

<210> 14 
<211> 24 
<212> DNA 



4 



<213> Synthetic 



<400> 14 

, 04 
ggagatggta gacactggtg gagt 

<210> 15 
<211> 24 
<212> DNA 
<213> Synthetic 

<400> 15 

24 

tcttgaaacc tccagacaca agaa 

<210> 16 
<211> 23 
<212> DNA 
<213> Synthetic 

<400> 16 

aagttacgat ttggcttcac tgg 

<210> 17 
<21i> 9 
<212> PRT 
<213> Synthetic 

<400> 17 

Phe Leu Gly Glu Phe Ala Thr Asp He 
1 5 

<210> 18 

<211> 9 

<212> PRT 

<213> Synthetic 

<400> 18 

Val Leu Phe Ala Cys Glu Glu Tyr Ala 
1 5 

<210> 19 
<211> 9 
<212> PRT 
<213> Synthetic 

<400> 19 

Leu Leu Gly Lys Arg Tyr He Glu Leu 
1 5 

<210> 20 
<211> 9 
<212> PRT 
<213> Synthetic 

<400> 20 

Gin Gin Phe Val Pro Pro Thr Asn Val 
1 5 

<210> 21 
<211> 9 
<212> PRT 
<213> Synthetic 

<400> 21 



5 



Cys Ser Ala Glu Glu Met Asn Phe Val 
1 5 



<210> 22 

<211> 9 

<212> PRT 

<213> Synthetic 

<400> 22 

Phe Leu Ser Lys Glu Asn Gin Val lie 
1 5 

<210> 23 
<211> 9 
<212> PRT 
<213> Synthetic 

<400> 23 

Ser Leu Gly Tyr Phe Pro Thr Ala Ala 
1 5 

<210> 24 

<211> 9 

<212> PRT 

<213> Synthetic 

<400> 24 

Thr Leu Pro Lys Glu Trp Val Cys He 
1 5 

<210> 25 
<211> 9 
<212> PRT 
<213> Synthetic 

<400> 25 

Tyr Thr Phe Pro Ala Pro Ala Ala Val 
1 5 

<210> 26 
<211> 9 
<212> PRT 
<213> Synthetic 



<400> 26 

Tyr Gin Tyr Ala Thr Glu Asp Gly Leu 
1 5 



